Skip to main content Skip to complementary content

Sharing analysis results: reports

After profiling the email and ZIP code columns and getting the detail results about the structure and consistency of the address data, you need to share these results with other business users.

You must first generate a report file on the analysis results from Talend Studio and save the report in a data quality data mart.

Procedure

  1. In the DQ Repository tree view, right-click the analysis name and select New Report.

    The report editor is displayed with the selected analysis listed in the Analysis List.

    Analysis list section
  2. In the Analysis list view and from the Template type list, select Evolution as the type for the report you want to generate.
    In this example, you want to generate an evolution report which provides information showing the evolution through time of the indicators used on the email and postal columns. This report allows you to compare current and historical statistics to determine the improvement or degradation of the address data. Such information is vital to decide to intervene and resolve data at the right time and thus monitor the quality of data on an on-going basis.
  3. Select the Refresh All check box to refresh the listed analysis before generating the report.
  4. In the Generated Report Settings view and from the File Type list, select to generate a PDF report file.
  5. In the Database Connection Settings view, set the connection parameters to the data mart where you want to store the report results.
    Database Connection Settings section
  6. Click the Check button to verify if your connection is successful.
    A message confirms if the database exists and if the connection is successful.
  7. If the database structure does not exist, click OK in the message to let Talend Studio creates it for you.
  8. Click OK to close the confirmation message.
  9. Save the report and click Run report icon on the editor toolbar to generate the report file.

Results

A report file is generate and listed under the Reports node in the DQ Repository tree view. The report shows the evolution through time of the simple statistics indicators and the patterns used on the email and postal columns.

Below are the results of the email column:

Chart results for the email column.

This chart shows that 89.80% of the email addresses are valid right now.

Charts for the simple statistics indicators.

For the simple statistics indicators, there are two charts: the first indicates the change in the statistics and the second indicates the percentage of that change.

Generating this report repeatedly will give a flat line if there is no change in data. The line will start to go upwards if data is fixed and downwards if data gets less accurate and consistent.

For more information on reports, see Reports in the Talend Studio User Guide.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!