tFileInputRegex
Reads a file row by row to split them up into fields using regular expressions and sends the fields as defined in the schema to the next component.
Powerful feature which can replace number of other components of the File family. Requires some advanced knowledge on regular expression syntax.
For more technologies supported by Talend, see Talend components.
Depending on the Talend product you are using, this component can be used in one, some or all of the following Job frameworks:
-
Standard: see tFileInputRegex Standard properties.
The component in this framework is available in all Talend products.
-
MapReduce: see tFileInputRegex MapReduce properties (deprecated).
The component in this framework is available in all Talend products with Big Data and Talend Data Fabric.
-
Spark Batch: see tFileInputRegex properties for Apache Spark Batch.
The component in this framework is available in all Talend products with Big Data and Talend Data Fabric.
-
Spark Streaming: see tFileInputRegex properties for Apache Spark Streaming.
This component is available in Talend Real Time Big Data Platform and Talend Data Fabric.