Extracting distinct values
About this task
From the Profiling perspective, you can create a column analysis to compute the number of most frequent values for each distinct record in a column. After executing the column analysis, you can generate a ready-to-use Job that will extract in an output file the distinct values from a value frequency.
You can then use these distinct values as a reference dataset for other data standardization processes.
In the example below a column analysis on a postal_code column in a MySQL database has been created and executed in the Profiling perspective.
Prerequisites: You have already created and executed a column analysis that uses the Value Frequency indicator.
To generate a Job that extracts distinct values from a value frequency, do the following