tHDFSInput
Extracts the data in a HDFS file for other components to process it.
tHDFSInput reads a file located on a given Hadoop distributed file system (HDFS) and puts the data of interest from this file into a Talend schema. Then it passes the data to the component that follows.
For more technologies supported by Talend, see Talend components.
Depending on the Talend product you are using, this component can be used in one, some or all of the following Job frameworks:
-
Standard: see tHDFSInput Standard properties.
The component in this framework is available in all Talend products with Big Data and in Talend Data Fabric.
-
MapReduce: see tHDFSInput MapReduce properties (deprecated).
The component in this framework is available in all Talend products with Big Data and Talend Data Fabric.