Data Vault 2.0 Blog
Guides, tutorials, and best practices for building modern data warehouses with dbt and Data Vault 2.0.
Hub or Link Satellite? When Your Entity Has No Business Key
The business key rule with a real FMCG order line example, the fan-out trap that silently loses rows, and a decision framework for hub vs link satellite.
dbt + Airflow + Data Vault — Our Tech Stack Explained
How we build Data Vault 2.0 pipelines using dbt, Apache Airflow, and PostgreSQL. A simple overview of our tech stack with a link to a live running demo.
Why You Should Re-run DAGs, Not Create New Ones
When source data is corrected, re-run the original DAG run — don't trigger a new one. Learn how LOAD_DATETIME, EFFECTIVE_FROM, and HASHDIFF work together for a complete audit trail.
PIT Tables in Data Vault 2.0: How to Query Satellites Without Duplicates
Why joining satellites directly produces duplicate rows, how to fix it with ROW_NUMBER(), and how to build Point-in-Time (PIT) tables for clean, fast mart queries.
Why Data Vault 2.0? Real Business Problems It Solves That Other Architectures Can't
7 real business requests from banking, e-commerce, and healthcare that Data Vault answers — but Kimball and 3NF cannot. With SQL examples.
How to Integrate Airflow with dbt for Data Vault 2.0
Step-by-step guide to orchestrating Data Vault 2.0 pipelines with Apache Airflow and dbt. Covers DAG setup, load datetime variables, incremental loading, and backfills.
What is Data Vault 2.0? A Complete Guide
Everything you need to know about Data Vault 2.0 methodology — what it is, why it exists, and how it compares to Kimball and Inmon approaches.