Big Data
Feature | Description | Available in |
---|---|---|
Support for branching and tagging with Iceberg components in Standard Jobs | You can now perform actions on branches and tags in your Iceberg table with the tIcebergTable in Standard Jobs. New parameters are now available in the Alter table action drop-down list, allowing you to either create or delete branches and tags. |
All subscription-based Talend products with Big Data |
Support for parallelization during output files writing in Spark Jobs | A new option, Parallelize output files writing, is
available in the Spark Configuration view of your Spark
Batch Jobs. When you select this option, it allows the Spark Batch Jobs to run
multiple threads in parallel when writing output files rather than writing
output files sequentially in one thread. This option improves the performance of the execution time. This feature is available for all
distributions, but is only available for Spark Batch Jobs containing the
following output components:
|
All subscription-based Talend products with Big Data |
Support for HDInsight connection mode with Hive components in Standard Jobs | HDInsight 5.0 and 5.1 versions are now supported in Hive components with ADLS Gen1 in Standard Jobs. |
All subscription-based Talend products with Big Data |
Support for HDInsight 5.1 with Spark Universal 3.3.x | You can now run your Spark Batch and Spark Streaming Jobs on HDInsight with
Spark Universal 3.3.x. You can configure it either in the Spark
Configuration view of your Spark Jobs or in the
Hadoop Cluster Connection metadata wizard, with
either ADLS Gen2 storage or Azure storage. When you select this mode, Talend Studio is compatible with HDInsight 5.1 version. |
All subscription-based Talend products with Big Data |