Diagram Visualization Common Features
There are a number of common features and tools available when visualizing a lineage trace, data model, etc.
Overview
You may click this Show overview icon to show or hide an Overview panel of the model diagram. Click in the overview to quickly move to a portion of the full diagram.
Zoom In/Out and Fit to content
Click Zoom in () or Zoom out () icons to adjust the aspect ratio of the diagram. Also, you may click on the Fit to content ()icon to view the entire diagram at the best zoom that will fit.
Collapse / Expand
Click Expand / Collapse to expand or collapse the entire diagram (ensure that you do not have an object selected, otherwise the action will only apply to that object).
You may also click on the plus sign for an object to expand and the minus sign to collapse just that object.
Data Lineage Diagram Display Options
You may control the display of lineage objects and their presentation using the lineage Display Options menu.
Here you may see the terms with Defined by relationships.
Show/Hide Columns
Click on the Display Options icon and click Show Conditional Labels
Here, you may pick and choose conditional labels to show in the diagram and the image shows all of them selected for display.
Show Mixed Connections
See Show Mixed Connections in the Classic Data Lineage Diagram.
Maximum Node Width
See Maximum Note Width in the Data Lineage Diagram.
Lineage Diagram Trace in General
Select the Analysis Diagram tab on the left to obtain this presentation. You will see a graphical presentation of the lineage (data impact or data source).
Additional options include:
Overview
You may click this icon to show or hide an Overview panel of the lineage trace diagram. Click in the overview to quickly move to a portion of the full diagram.
Zoom In/Out and Fit to content
Click Zoom in or Zoom out icons to adjust the aspect ratio of the diagram. Also, you may click on the Fit to content icon to view the entire diagram at the best zoom that will fit.
Collapse / Expand
Click Expand / Collapse to expand or collapse the entire diagram (ensure that you do not have an object selected, otherwise the action will only apply to that object).
You may also click on the plus sign for an object to expand and the minus sign to collapse just that object.
Open the object page
You may right-click and select Open (),to navigate to the object page.
You may download a PNG or SVG image of the diagram.
Quick find
In the upper right, there is a search text box that will provide a quick list of object names that contain the text you type. You may click on any of the results to select that object in the diagram and moving the focus there.
Explore Further
Invoking a lineage trace from any reference to a object
You may invoke a lineage trace from any diagram or any list of results (e.g., from a Browse or Search), either via right-click context menu
Interpreting the graphical lineage
In general, the lineage tools within Talend Data Catalog function identically whether one is analyzing data flow lineage, semantic lineage or both. However, the presentation is different, as follows:
In addition, Talend Data Catalog has four levels of presentation:
- Configuration Model Connections Overview – which is a diagram representing the various Models contained within a configuration and how they are related (or stitched) to each other based upon connection definitions manually assigned to Talend Data Catalog .
- Model Connections Overview – which is a diagram representing the various Models contained within the directory of an external repository and how they are related (or stitched) to each other based upon connection definitions already provided in the external metadata repository.
- Model Lineage Overview – which is a diagram representing an overview of the lineage within a given Model.
- Lineage Trace analysis at the configuration or Model level – which is a fully detailed trace of semantic and/or data flow lineage for detailed analysis.
Properties Panel
Click to select a object and view its properties in the Properties Panel on the right. You may show and hide this panel as needed.
Display Options
Data Flow Settings
Many options are available in the menus of a data flow lineage report.
One may include or filter out various object types in order to focus only on specific types of objects in the lineage.
Click Edit Filters and specify:
- SHOW TEMPORARY OBJECTS to show intermediate temporary tables/columns in the lineage
- SHOW INTERNAL OBJECTS to show any intermediate schemas/tables/columns between connections in the lineage
- SHOW EXTERNAL OBJECTS to show any external source tables or files which an object in the lineage from which the object is derived
- SHOW EXTERNAL TABLE LOCATION OBJECTS to include objects which are only external table locations that require connection resolution.
- EXCLUDE MODEL TYPES to not show specific types of models in the lineage
- EXCLUDE MODELS to not show specifically selected models.
In some cases you may see that a lineage diagram is taking an excessive amount of time to display or that you are presented with the message:
This large diagram has xxxxx objects and xxxxx links which may require more resources that what your browse case handle.
You may use the PROCEED ANYWAY button to try to visualize the diagram.
You may also save these settings as defaults in future lineage traces.
Steps
- Begin a lineage trace.
- Click Edit Filters and specify:
- SHOW TEMPORARY OBJECTS to show intermediate temporary tables/columns in the lineage
- SHOW INTERNAL OBJECTS to show any intermediate schemas/tables/columns between connections in the lineage
- SHOW EXTERNAL OBJECTS to show any external source tables or files which an object in the lineage from which the object is derived
- SHOW EXTERNAL TABLE LOCATION OBJECTS to include objects which are only external table locations that require connection resolution.
- EXCLUDE MODEL TYPES to not show specific types of models in the lineage
- EXCLUDE MODELS to not show specifically selected models.
Show Internal/External Objects
Lineage reporting may
- either Show Internal Objects within a model (e.g., interim steps in transformations) or just the objects stitched to other model objects.
- either Show External Objects that are not directly material to the lineage trace (such as the link from files in HDSF to the tables representing them in Hive) or not show these objects.
Show Temporary Objects
Big data solutions and other ETL/DI processes use temporary files and tables routinely. When harvesting, Talend Data Catalog detects temporary files and marks them as TEMPORARY in their lineage characteristics. This fact means that you can distinguish temporary objects from permanent/stitchable ones in a lineage diagram and, optionally hide/show them.
Show External Table Location Objects
Models may refer to external tables that require connection resolution. By default, these table location objects are not shown. You may use this option to explicitly show them.
Default View
This option allows you to save the current filter setting to be the default for future trace reports.