Data Vault 2.0 Blog

Guides, tutorials, and best practices for building modern data warehouses with dbt and Data Vault 2.0.

Hub or Link Satellite? When Your Entity Has No Business Key

The business key rule with a real FMCG order line example, the fan-out trap that silently loses rows, and a decision framework for hub vs link satellite.

April 16, 2026 15 min read

Best Practice

Why You Should Re-run DAGs, Not Create New Ones

When source data is corrected, re-run the original DAG run — don't trigger a new one. Learn how LOAD_DATETIME, EFFECTIVE_FROM, and HASHDIFF work together for a complete audit trail.

March 15, 2026 10 min read

Tutorial

PIT Tables in Data Vault 2.0: How to Query Satellites Without Duplicates

Why joining satellites directly produces duplicate rows, how to fix it with ROW_NUMBER(), and how to build Point-in-Time (PIT) tables for clean, fast mart queries.

March 11, 2026 12 min read

Deep Dive

Why Data Vault 2.0? Real Business Problems It Solves That Other Architectures Can't

7 real business requests from banking, e-commerce, and healthcare that Data Vault answers — but Kimball and 3NF cannot. With SQL examples.

March 11, 2026 15 min read

Tutorial

How to Integrate Airflow with dbt for Data Vault 2.0

Step-by-step guide to orchestrating Data Vault 2.0 pipelines with Apache Airflow and dbt. Covers DAG setup, load datetime variables, incremental loading, and backfills.

March 10, 2026 15 min read

Guide

What is Data Vault 2.0? A Complete Guide

Everything you need to know about Data Vault 2.0 methodology — what it is, why it exists, and how it compares to Kimball and Inmon approaches.

March 10, 2026 10 min read