tFileOutputParquet
Receives records from the processing component placed ahead of it and writes the
records into Parquet format files in a given distributed file system.
Depending on the Talend product you are using, this component can be used in one, some or all of the following Job frameworks:
-
MapReduce: see tFileOutputParquet MapReduce properties.
The component in this framework is available in all subscription-based Talend products with Big Data and Talend Data Fabric.
-
Spark Batch: see tFileOutputParquet properties for Apache Spark Batch.
The component in this framework is available in all subscription-based Talend products with Big Data and Talend Data Fabric.
-
Spark Streaming: see tFileOutputParquet properties for Apache Spark Streaming.
This component is available in Talend Real Time Big Data Platform and Talend Data Fabric.