Skip to main content Skip to complementary content

What's new in R2022-07

Big Data: new features

Feature

Description

Available in

Support of Databricks runtime 10.x and onwards on Spark Universal 3.2.x You can now run your Spark Batch and Streaming Jobs on job and all-purpose Databricks clusters, both on AWS and Azure, using Spark Universal with Spark 3.2.x. You can configure it either in the Spark Configuration view of your Spark Jobs or in the Hadoop Cluster Connection metadata wizard.

When you select this mode, Talend Studio is compatible with Databricks 10.x version.

All subscription-based Talend products with Big Data

Support of simple mapping with Dataset when using tMap in Spark Batch Jobs Following the migration of component APIs from Resilient Distributed Dataset (RDD) to Dataset (DS), you can now perform simple mapping with DS as well as RDD when you use tMap in your Spark Batch Jobs.
For example, the following screenshot shows a simple mapping without any transformation from the input table to the output table:

All subscription-based Talend products with Big Data

Support of IDBroker with Cloudera CDP Public Cloud Talend Studio now supports IDBroker as part of Knox authentication service with Cloudera CDP Public Cloud. When you use S3 in your Spark Batch Jobs, IDBroker allows to read and write any type of files using S3 as an external storage and provides the token generation.

All subscription-based Talend products with Big Data

Support of Azure Active Directory (AD) Service Principal for Azure Synapse Spark pools in Spark Jobs

When you use Azure Synapse Analytics with Apache Spark pools as a distribution for your Spark Batch and Spark Streaming Jobs, you can now authenticate with Azure AD Service Principal. You can configure it either in the Spark Configuration view of your Spark Jobs or in the Hadoop Cluster Connection metadata wizard.

When you select the Azure Active Directory authentication mode, you can also authenticate using a certificate now.

All subscription-based Talend products with Big Data

Data Integration: new features

Feature

Description

Available in

Enhancement of data type mapping to modify, restore and migrate the default mapping files The data type mapping in Talend Studio has been improved.
  • The default data type mapping files can now only be modified. After modification, you can restore them to the Talend default data type mapping files corresponding to your current Talend Studio version.
  • The default data type mapping files are now migrated automatically after applying a Talend Studio monthly update.

For more information, see Type mapping.

All subscription-based Talend products with Talend Studio

Support of deleting the .m2 folder after updating Talend Studio You can now delete the .m2 folder under <Talend-Studio>\configuration\ after applying a Talend Studio monthly update. You must close Talend Studio before deleting the .m2 folder. This helps you to easily remove the obsolete JAR files affected by Common Vulnerabilities and Exposures (CVEs).

For more information, see Updating Talend Studio.

All subscription-based Talend products with Talend Studio

MIMB OEM cumulative patch upgrade for Talend Metadata Bridge The MIMB OEM cumulative patch for Talend Metadata Bridge has been upgraded from MIMB-OEM-CumulativePatch-1010-20220208-1632.zip to MIMB-OEM-CumulativePatch-1010-20220622-1543.zip.

All subscription-based Talend products with Talend Studio

Improvements of FTP components: JAR upgrade, new JSch client options, and enhanced logging JSch JAR is upgraded to version 0.2.1. A new option, Config client, is provided by FTP components, allowing you to set JSch client options.

All subscription-based Talend products with Talend Studio

Enhancement of tMongoDBOutput and tCosmosDBOutput to support dynamic schema

The tMongoDBOutput and tCosmosDBOutput components support dynamic schema when inserting documents to collections.

All subscription-based Talend products with Big Data

Enhancement of the Snowflake metadata wizard This feature enables the Snowflake metadata wizard to retrieve new schemas without affecting existing ones.

All subscription-based Talend products with Talend Studio

Data Quality: new features

Feature

Description

Available in

tDataQualityRules for Spark The component is now available for Apache Spark Batch and Streaming.
It supports:
  • Apache Spark 3.0 and greater in local mode.
  • Cloudera Data Engineering service with Apache Spark 3.1 and 3.2.

All Talend Platform and Data Fabric products

Viewing rows of match analysis The View rows feature is now available for match analyses from the Duplicate Record Statistics or Group Statistics tabs.

It lets you view all the analyzed data corresponding to the row you selected. You can export those data into a CSV file.

All Talend Platform and Data Fabric products

Continuous Integration: new features

Feature

Description

Available in

Talend CI Builder upgraded to version 8.0.7 Talend CI Builder is upgraded from version 8.0.5 to version 8.0.7.

Use Talend CI Builder 8.0.7 in your CI commands or pipeline scripts from this monthly version onwards until a new version of Talend CI Builder is released.

All subscription-based Talend products with Talend Studio

Support of displaying project details at build time

Talend CI Builder now supports displaying project details at build time.

All subscription-based Talend products with Talend Studio

Compatibility check between Talend CommandLine and project versions at build time

Talend CI Builder now supports checking the compatibility between Talend CommandLine and project versions at build time. If the Talend CommandLine version is older than your project version, the build fails and an error message is displayed.

All subscription-based Talend products with Talend Studio

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!