Executing the Job
Procedure
Save your Job and press F6 to execute it.
The output columns include the
T_GEN_KEY column that holds
the functional key generated by the tGenKey component.
You can see that all records that have the same
functional key are grouped together in different
blocks "groups". The identifier for each group is
listed in the GID column next
to the corresponding record. The number of records
in each of the output blocks is listed in the
GRP_SIZE column and computed
only on the master record. The
MASTER column indicates with
true/false if the corresponding record is a master
record or not a master record. The
SCORE column lists the
calculated distance between the input record and the
master record according to the
Jaro-Winkler matching
algorithm.
For an example of creating data partitions based on different
blocking keys and using them with multiple tMatchGroup components, see tMatchGroup.