Skip to main content Skip to complementary content

Setting system indicators

This column analysis uses out-of-box indicators to provide simple statistics such as row, blank and duplicate counts on the Email and Phone columns.

Before you begin

  • You have opened the Profiling perspective in Talend Studio.

  • You have created a column analysis and defined the connection to the database.

Procedure

  1. In the Data Preview section in the analysis editor, click Select indicators to open the Indicator Selection dialog box.
  2. Expand Simple Statistics and select Row Count, Blank Count and Duplicate Count. Click OK to close the wizard.

    You want to see the row, blank and duplicate counts in the Email and Phone columns to see how consistent the data is.

    Indicators are added accordingly to the columns in the Analyzed Columns section.

  3. Click Options icon next to the Duplicate Count and Blank Count indicators and set 0 in the Upper threshold field.

    Defining thresholds on the Email and Phone columns is very helpful as it will write in red the count of the duplicate and blank values in the analysis results.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!