Validating data against schema
This scenario presents a Job that checks the type, nullability, and length of data from an incoming flow against a defined reference schema, and displays the validation results on the Run console.
The incoming flow comes from a simple CSV file that contains heterogeneous
data including wrong data type, data exceeding the maximum length, wrong ID and null values
in non-nullable columns, as shown below:
ID;Name;BirthDate;State;City
1;Dwight;06-04-2008;Delaware;Concord
2;Warren;25-10-2008;Montana
3;Benjamin;17-08-2008;Washington;Austin
4;Harry;14-04-2008;Kansas;Annapolis
5;Ulysses;2007-04-12;Michigan;Raleigh
6;James;19-08-2007;Delaware;Charleston
.7;Bill;20-04-2007;Illinois;Bismarck
8;Ulysses;04-12-2008;;Saint Paul
9;Thomas;09-05-2008;Maryland;Albany
10;Ronald;11-02-2008;Florida;Hartford
This scenario applies only to Talend Data Management Platform, Talend Big Data Platform, Talend Real-Time Big Data Platform, Talend MDM Platform, Talend Data Services Platform, Talend MDM Platform and Talend Data Fabric.
Setting up the Job
Procedure
Configuring the components
Procedure
Executing the Job
Procedure
Save your Job and press F6 to launch it.
Two tables are displayed on the console, showing the valid data and
rejected data respectively.