In this scenario, the main input schema is already stored in the Repository. For more information about storing schema
metadata in the repository, see the Talend Studio User
Guide.
Procedure
In the Repository tree view, expand
Metadata - DB
Connections where you have stored the main input schema and
drop the database table onto the design workspace. The input table used in
this scenario is called customer.
A dialog box is displayed with a list of components.
Select the relevant database component, tMysqlInput in this example, and then click OK.
Drop two tGenKey components, two
tMatchGroup components, a tMap and a tLogRow components from Palette onto the design workspace.
Link the input component to the tGenKey
and tMap components using Main links.
In the two tMatchGroup components, select the
Output distance details check boxes in the
Advanced settings view of both components
before linking them together.
This will provide the MATCHING_DISTANCES column in the
output schema of each tMatchGroup.
If the two tMatchGroup components are already
linked to each other, you must select the Output distance
details check box in the second component in the Job flow first
otherwise you may have an issue.
Link the two tMatchGroup components and
the tLogRow component using Main links.
If needed, give the components specific labels to reflect their usage in
the Job.
For further information about how to label a component, see Talend StudioUser Guide.
Did this page help you?
If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!