Profiling an HDFS file
From the
Profiling
perspective of Talend Studio, you can generate a column analysis with simple statistics indicators on an HDFS file
via a Hive connection.
Procedure
The sequence to create a profiling analysis on an HDFS file involves the following steps:
What to do next
You can then modify the analysis settings and add other indicators as needed. You can also create other analyses later on this HDFS file by using the same Hive table.