Lineage Trace
This method of analysis presents either graphical or textual representation of the flow of data through connection definitions to data stores and physical transformation rules which transform and move the data. In order to see data flow lineage, one must
- Define a configuration that contains all of the models potentially in the data flow
- Stitch the models together by resolving connection definitions and Build the configuration
Once the configuration is ready, then you are ready to report on lineage.
In the Data Lineage Diagram, all columns/fields of a given table/file are presented at once which matches the classic data modeling concepts. Selection of a given column/field allows a user to highlight the data flow to it.
End-to-end data flow lineage across models is only available at the classifier (e.g., table) and feature (e.g., column) level. If instead one goes to the object page for a schema or model, as this is not classifier or feature, the data flow tab shows the overview lineage within the scope of that model only.
A data flow lineage trace presents summary lineage as opposed to the data flow overview lineage which presents a step by step transformation lineage.
When you trace impact/lineage of a table or column, you do not see all the transformations. Instead, you see a summary of the whole job (you get a picture much closer to the one for an architecture diagram). But, you are also able to see complete end-to-end lineage (not just confined to one DI or BI model).
Finally, the tool does not display constants on the lineage diagram. In particular, this means that if a constant appears as a source for lineage and that process only has that constant as a source for a lineage trace, you will not see that process in the lineage trace.
Steps
- Sign in as a user which has at least the Metadata Viewing capability object role assignment to the configuration and all its contained models.
Without the Metadata Viewing capability object role assignment to all the configuration’s contained model, you will see a dialog indicating that you do not have sufficient privileges.
- Find a starting point for lineage by either
- Navigate to that element’s object page and select the Lineage tab
- Or, for lists of elements, click the line the element is on and click the appropriate Open Lineage icon
- Or, right click on the element in a diagram (architecture diagram, lineage diagram or model diagram) and select Open Lineage
- From here you may
- Use the Lineage Trace Header Options including specify the lineage presentation
- Use the Diagram Visualization Common Features, including Switch to
- Data Lineage - trace from an object upstream to objects that provide data flow to that object
- Data Impact - trace from an object downstream to objects that are impacted via data flow by that object
- Full Data Lineage - Both of the above.