Big Data
Feature |
Description |
Available in |
---|---|---|
Support for Iceberg table format in tHiveCreateTable in Standard Jobs | You can now create an Iceberg table with tHiveCreateTable in Standard Jobs
using either Cloudera or Amazon EMR distributions. Using Iceberg tables allows you to work with different file formats such as Parquet, ORC, and Avro with Cloudera distributions, and Parquet only with Amazon EMR distributions. |
All subscription-based Talend products with Big Data |
New tHBaseNamespace component to create namespaces for HBase tables in Standard Jobs | A new component, tHBaseNamespace, is available in Talend Studio in your Standard Jobs. This component allows you to create a namespace for HBase tables. |
All subscription-based Talend products with Big Data |
Support for HDInsight 5.0 with Spark Universal 3.1.x | You can now run your Spark Batch and Spark Streaming Jobs on HDInsight with
Spark Universal 3.1.x. You can configure it either in the Spark
Configuration view of your Spark Jobs or in the
Hadoop Cluster Connection metadata wizard, with
either ADLS Gen2 storage or Azure storage. When you select this mode, Talend Studio is compatible with HDInsight 5.0 version. You must deactivate Log4j in components to be able to run your Spark Jobs on HDInsight. To do so, go to Activate log4j in components check box. and clear the |
All subscription-based Talend products with Big Data |
Support for AWS EMR Serverless 6.6.0 with Spark Universal 3.2.x and 3.3.x | You can now run your Spark Batch Jobs on AWS EMR Serverless with Spark
Universal 3.2.x and 3.3.x. You can configure it in the Spark
Configuration view of your Spark Batch Jobs. When you select this mode, Talend Studio is compatible with AWS EMR Serverless 6.6.0 version. |
All subscription-based Talend products with Big Data |