Data Engineering

Slowly Changing Dimensions: Types 1-6 Explained with Examples

Saad Selim

May 4, 2026

10 min read

Slowly Changing Dimensions (SCD) are dimension records in a data warehouse that change over time. A customer's address changes. An employee's department changes. A product's price changes. How you handle these changes determines whether your historical analysis is accurate.

The Problem

In a transactional database, you update the record and move on. But in a data warehouse used for analytics, you need to answer questions like:

"What was this customer's segment WHEN they placed that order?"
"How much revenue did the Northeast region generate last year?" (even though some accounts have since moved to a different region)

If you simply overwrite dimension values, you lose the ability to answer these historical questions accurately.

SCD Types

Type 0: Retain Original (Never Change)

The dimension value is fixed at the time of initial load and never updated, regardless of source changes.

Use case: Original customer acquisition channel, birth date, employee hire date.

Example: Customer's "original_signup_source" stays "Google Ads" forever, even if they later discover you through other channels.

Type 1: Overwrite (No History)

Replace the old value with the new value. No history preserved.

customer_id	name	city
101	Alice	New York

Alice moves to Boston. After Type 1 update:

customer_id	name	city
101	Alice	Boston

Use case: Corrections (fixing typos), attributes where history does not matter (email format standardization).

Pros: Simple, no extra storage, no complexity. Cons: Cannot analyze historical state. "Revenue by customer city" will attribute all of Alice's past orders to Boston.

Type 2: Add New Row (Full History)

Insert a new row for the dimension record with effective dates. The old row is marked as expired.

customer_id	name	city	effective_from	effective_to	is_current
101	Alice	New York	2024-01-15	2026-03-01	false
101	Alice	Boston	2026-03-01	9999-12-31	true

Use case: Any attribute where historical accuracy matters. Customer segment, territory assignment, product category.

Pros: Complete history, accurate point-in-time analysis. Cons: Table grows larger over time, joins require date range logic, more complex ETL.

Query pattern for Type 2:

-- Get customer's city at the time of their order
SELECT o.order_id, o.amount, c.city
FROM orders o
JOIN customers c ON o.customer_id = c.customer_id
    AND o.order_date >= c.effective_from
    AND o.order_date < c.effective_to;

Type 3: Add New Column (Limited History)

Add a column for the previous value. Only keeps one generation of history.

customer_id	name	current_city	previous_city	city_change_date
101	Alice	Boston	New York	2026-03-01

Use case: When you only need the current and immediately prior value (not full history).

Pros: No table growth, simple schema, no date range joins. Cons: Only one prior version kept. Subsequent changes overwrite the previous column.

Type 4: History Table (Separate)

Keep the current value in the main dimension table and store all historical changes in a separate history table.

Main table (current):

customer_id	name	city
101	Alice	Boston

History table:

customer_id	city	changed_from	changed_to
101	New York	2024-01-15	2026-03-01
101	Boston	2026-03-01	NULL

Use case: When the main dimension table must stay small and fast, but history is needed for auditing.

Pros: Main table stays compact, history available when needed. Cons: Requires joining to history table for point-in-time analysis.

Type 5: Type 4 + Type 1 (Mini-Dimension)

Combines a mini-dimension (frequently changing attributes) with the main dimension. The main dimension has a foreign key to the current mini-dimension row.

Use case: Large dimensions with a few rapidly changing attributes (e.g., customer demographics where income bracket and age group change frequently).

Type 6: Hybrid (Type 1 + 2 + 3 Combined)

Combines Type 1 (current value in every row), Type 2 (history rows), and Type 3 (previous value column):

surrogate_key	customer_id	current_city	historical_city	effective_from	effective_to	is_current
1001	101	Boston	New York	2024-01-15	2026-03-01	false
1002	101	Boston	Boston	2026-03-01	9999-12-31	true

Note: current_city is always the LATEST value (updated across all rows via Type 1). historical_city is the value that was active during that row's effective period.

Use case: When you frequently need both current state and historical state in the same query.

Choosing the Right Type

Scenario	Recommended Type
Corrections and data cleaning	Type 1
Attributes with no analytical history need	Type 1
Anything requiring point-in-time accuracy	Type 2
Only need "before and after" comparison	Type 3
Fast lookups + audit trail	Type 4
Large dimensions with mini-dimension pattern	Type 5
Complex analytical needs (current + historical)	Type 6

Implementation in dbt

Most modern data warehouses use dbt for SCD Type 2:

-- dbt snapshot (implements Type 2 automatically)
{% snapshot customer_snapshot %}
{{
    config(
      target_schema='snapshots',
      unique_key='customer_id',
      strategy='timestamp',
      updated_at='updated_at',
    )
}}
SELECT * FROM {{ source('raw', 'customers') }}
{% endsnapshot %}

dbt snapshots automatically track changes, add effective_from/effective_to columns, and maintain the is_current flag.

Performance Considerations

Type 2 tables grow. A customer dimension with 1M customers where 10% change quarterly will have 1.4M rows after a year. Plan storage and indexing accordingly.
Index effective dates. Queries on Type 2 dimensions always filter on date ranges. Index (business_key, effective_from, effective_to).
Partition by is_current. Most queries only need current records. Partition or create a view for current-only access.
Surrogate keys. Type 2 requires surrogate keys (auto-incrementing IDs) because the natural business key is no longer unique.

Summary

Type	Approach	History?	Complexity
0	Never update	Original only	Minimal
1	Overwrite	None	Low
2	New row with dates	Full	Medium
3	Previous value column	One generation	Low
4	Separate history table	Full (separate)	Medium
5	Mini-dimension	Partial	High
6	Hybrid (1+2+3)	Full + current	High

Type 2 is the most common in practice because it balances complexity with analytical power. Use Type 1 for attributes that do not matter historically. Reserve Types 5-6 for edge cases that demand both performance and complete history.

Share this article

Saad Selim

The Skopx engineering and product team

Slowly Changing Dimensions: Types 1-6 Explained with Examples

The Problem

SCD Types

Type 0: Retain Original (Never Change)

Type 1: Overwrite (No History)

Type 2: Add New Row (Full History)

Type 3: Add New Column (Limited History)

Type 4: History Table (Separate)

Type 5: Type 4 + Type 1 (Mini-Dimension)

Type 6: Hybrid (Type 1 + 2 + 3 Combined)

Choosing the Right Type

Implementation in dbt

Performance Considerations

Summary

Share this article

Saad Selim

Related Articles

Data Modeling Tools: Top Options for 2026 Compared

Reverse ETL: What It Is, Why You Need It, and How It Works

Data Manipulation: Techniques, Tools, and Best Practices

Data Preprocessing: Steps, Techniques, and Why It Matters

Modern Data Stack: What It Includes and Where It Is Heading in 2026

Conversational Intelligence Software: What It Is and Who Needs It

Stay Updated