Downstream Lineage: Impacts
Answer the question: "What breaks if I change this?"
Impact Analysis
Step 1: Identify Direct Consumers
For tables:
- List DAGs using
list_active_dagsandlist_paused_dags - Search DAG source using
get_dag_source_codefor table references
For DAGs:
- Determine outputs from DAG source
- Trace consumer DAGs that read those outputs
Step 2: Build Dependency Tree
Example:
SOURCE: fct.orders
|
+-- TABLE: agg.daily_sales -> Dashboard: Executive KPIs
+-- TABLE: ml.order_features -> Model: Forecasting
Step 3: Categorize by Criticality
- Critical: production dashboards, customer-facing apps
- High: internal ops dashboards, downstream ETL
- Medium: ad-hoc analysis
- Low: deprecated or test data
Step 4: Assess Change Risk
Consider:
- Schema changes
- Data timing changes
- Deletion or deprecation
Step 5: Find Stakeholders
- DAG owners in DAG definitions
- Dashboard owners in BI tools
Output: Impact Report
Include:
- Summary of downstream assets
- Impact diagram
- Detailed impacts and owners
- Risk assessment and mitigations
Related Skills
- tracing-upstream-lineage
- checking-freshness
- debugging-dags
- annotating-task-lineage
- creating-openlineage-extractors