Setting up the Job
Procedure
- Drop these components from the Palette to the design workspace: tFileInputDelimited, tExtractDynamicFields, tUniqRow, tFileOutputDelimited, and tLogRow, and name the components as shown above to better identify their roles in the Job.
- Connect the component labelled People, the component labelled Split_Column, and the component labelled Deduplicate using Row > Main connections.
- Connect the component labelled Deduplicate and the component labelled Unique_Families using a Main > Uniques connection.
- Connect the component labelled Deduplicate and the component labelled Duplicated_Families using a Main > Duplicates connection.