Auto CDC (apply_changes) in Spark Declarative Pipelines
The apply_changes API enables processing Change Data Capture (CDC) feeds to automatically handle inserts, updates, and deletes in target tables.
Key Concepts
Auto CDC in Spark Declarative Pipelines:
- •Automatically processes CDC operations (INSERT, UPDATE, DELETE)
- •Supports SCD Type 1 (update in place) and Type 2 (historical tracking)
- •Handles ordering of changes via sequence columns
- •Deduplicates CDC records
Language-Specific Implementations
For detailed implementation guides:
- •Python: auto-cdc-python.md
- •SQL: auto-cdc-sql.md
Note: The API is also known as applyChanges in some contexts.