Drop the following components from the Palette onto the
design workspace: tFileInputDelimited,
tMatchPairing, tLogRow and two
tFileOutputDelimited.
Connect tFileInputDelimited to
tMatchPairing using the Main link.
tFileInputDelimited reads the source file and sends
data to the next component.
Connect tMatchPairing to the output file components
using the Pairs and Unique rows
links, and to tLogRow using the Exact
duplicates link.
tMatchPairing pre-analyzes the data, computes pairs
of suspect duplicates, unique rows and exact duplicates and generates a pairing
model to be used with tMatchPredict
Did this page help you?
If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!