Skip to main content Skip to complementary content

Publishing datasets and lineage to Qlik Cloud

Data Integration Jobs published from Talend Studio to Talend Cloud can generate input and output datasets and lineage, which can be sent to Qlik Cloud. In the Catalog page of Qlik Cloud, you can view the lineage of your data that tracks the transformations backwards to the original source.

Before you begin

  • You have a Qlik Talend Cloud Premium Edition or Qlik Talend Cloud Enterprise Edition license.
  • You have already configured the connection and authentication from Talend Management Console to Qlik Cloud. For more information, see Configuring Talend Cloud with Qlik Cloud.
  • The Job to publish reads data from a data source with an input component, transforms the data, and then maps the data to pass it to an output component.

Procedure

  1. Publish your Job to Talend Cloud. For more information on how to do it, see Publishing to Talend Cloud.

    Example

    Data Integration Job in the design workspace.
  2. Edit your Job task in Talend Management Console to enable the lineage collection option. For more information, see Enabling lineage collection for Job tasks.
  3. Run the Job task in Talend Management Console. For more information, see Executing Job tasks manually.
    The Job artifact generates datasets and lineage to Qlik Cloud.
    Information noteNote: The lineage is generated at run time when the task is executed for the first time, or after the task is modified.

What to do next

You can now open the Catalog page in Qlik Cloud to view the datasets. If you select Lineage for a dataset, you can view the lineage graph of your data, showing the transformations backwards to the original source. For more information, see Browsing datasets from the Catalog and Analyzing lineage in Analytics.

Data lineage in Qlik Cloud.

In this example, the data lineage graph shows the data transformation from the input datasets STUDENT_INFO_G4 and TEACHER_INFO to the output G4_ALL. The input datasets are read by the tSnowflakeInput components, transformed by the Data Integration Job DemoLineage_Step1 and then written by the tSnowflakeOutput component into the Snowflake table.

Information noteNote: A Job artifact can generate datasets and lineage for input and output connectors for files and databases. But as Talend Studio supports a different list of connectors from Qlik Cloud, the support scope of Qlik Cloud will be limited on some operations like Profiling and Preview. For more information, see Understanding your data with the catalog in Data Integration.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!