Data Vault 2.0 Blog

Guides, tutorials, and best practices for building modern data warehouses with dbt and Data Vault 2.0.

Deep Dive

Hub or Link Satellite? When Your Entity Has No Business Key

The business key rule with a real FMCG order line example, the fan-out trap that silently loses rows, and a decision framework for hub vs link satellite.

April 16, 2026 15 min read
Tech Stack

dbt + Airflow + Data Vault — Our Tech Stack Explained

How we build Data Vault 2.0 pipelines using dbt, Apache Airflow, and PostgreSQL. A simple overview of our tech stack with a link to a live running demo.

March 28, 2026 5 min read
Best Practice

Why You Should Re-run DAGs, Not Create New Ones

When source data is corrected, re-run the original DAG run — don't trigger a new one. Learn how LOAD_DATETIME, EFFECTIVE_FROM, and HASHDIFF work together for a complete audit trail.

March 15, 2026 10 min read
Tutorial

PIT Tables in Data Vault 2.0: How to Query Satellites Without Duplicates

Why joining satellites directly produces duplicate rows, how to fix it with ROW_NUMBER(), and how to build Point-in-Time (PIT) tables for clean, fast mart queries.

March 11, 2026 12 min read
Deep Dive

Why Data Vault 2.0? Real Business Problems It Solves That Other Architectures Can't

7 real business requests from banking, e-commerce, and healthcare that Data Vault answers — but Kimball and 3NF cannot. With SQL examples.

March 11, 2026 15 min read
Tutorial

How to Integrate Airflow with dbt for Data Vault 2.0

Step-by-step guide to orchestrating Data Vault 2.0 pipelines with Apache Airflow and dbt. Covers DAG setup, load datetime variables, incremental loading, and backfills.

March 10, 2026 15 min read
Guide

What is Data Vault 2.0? A Complete Guide

Everything you need to know about Data Vault 2.0 methodology — what it is, why it exists, and how it compares to Kimball and Inmon approaches.

March 10, 2026 10 min read