Selecting the set of columns you want to analyze in the delimited file
Big Data Platform
Cloud API Services Platform
Cloud Big Data Platform
Cloud Data Fabric
Cloud Data Management Platform
Data Fabric
Data Management Platform
Data Services Platform
MDM Platform
Real-Time Big Data Platform
Procedure
Expand the FileDelimited connection and
browse to the set of columns you want to analyze.
Select the columns to be analyzed, and then click Finish to close this New
analysis wizard.
The analysis editor opens with the defined analysis
metadata, and a folder for the newly created analysis is displayed under
Analyses in the DQ Repository tree view.
A sample data is displayed in the Data Preview section and the selected columns are displayed
in the Analyzed Column section of the
analysis editor.
If required, select another connection from the Connection
box in the Analyzed Columns view. This box lists all the
connections created in Talend Studio
with the corresponding database names.
By default, the delimited file connection you have selected in the previous step
is displayed in the Connection box.
If required, click the Select columns to
analyze link to open a dialog box where you can modify your
column selection.
Information noteNote: You can filter the table or column lists by typing the desired text in the
Table filter or Column
filter fields respectively. The lists will show only the
tables/columns that correspond to the text you type in.
In the column list, select the check boxes of the columns you want to analyze and
click OK to proceed to the next step.
In this example, you want to analyze a set of six columns in the delimited file:
account number (account_num), education (education), email (email), first name
(fname), last name (lname) and gender (gender). You want to identify the number of
rows, the number of distinct and unique values and the number of duplicates.
Did this page help you?
If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!