What's new in R2022-07
Big Data: new features
Feature |
Description |
Available in |
---|---|---|
Support of Databricks runtime 10.x and onwards on Spark Universal 3.2.x | You can now run your Spark Batch and Streaming Jobs on job and all-purpose
Databricks clusters, both on AWS and Azure, using Spark Universal with Spark
3.2.x. You can configure it either in the Spark
Configuration view of your Spark Jobs or in the Hadoop
Cluster Connection metadata wizard. When you select this mode, Talend Studio is compatible with Databricks 10.x version. |
All subscription-based Talend products with Big Data |
Support of simple mapping with Dataset when using tMap in Spark Batch Jobs | Following the migration of component APIs from Resilient Distributed Dataset
(RDD) to Dataset (DS), you can now perform simple mapping with DS as well as RDD
when you use tMap in your Spark Batch Jobs. For example, the following
screenshot shows a simple mapping without any transformation from the input
table to the output table:
|
All subscription-based Talend products with Big Data |
Support of IDBroker with Cloudera CDP Public Cloud | Talend Studio now supports IDBroker as part of Knox authentication service with Cloudera CDP Public Cloud. When you use S3 in your Spark Batch Jobs, IDBroker allows to read and write any type of files using S3 as an external storage and provides the token generation. |
All subscription-based Talend products with Big Data |
Support of Azure Active Directory (AD) Service Principal for Azure Synapse Spark pools in Spark Jobs |
When you use Azure Synapse Analytics with Apache Spark pools as a distribution for your Spark Batch and Spark Streaming Jobs, you can now authenticate with Azure AD Service Principal. You can configure it either in the Spark Configuration view of your Spark Jobs or in the Hadoop Cluster Connection metadata wizard. When you select the Azure Active Directory authentication mode, you can also authenticate using a certificate now. |
All subscription-based Talend products with Big Data |
Data Integration: new features
Feature |
Description |
Available in |
---|---|---|
Enhancement of data type mapping to modify, restore and migrate the default mapping files | The data type mapping in Talend Studio
has been improved.
For more information, see Type mapping. |
All subscription-based Talend products with Talend Studio |
Support of deleting the .m2 folder after updating Talend Studio | You can now delete the .m2 folder under
<Talend-Studio>\configuration\ after applying a
Talend Studio
monthly update. You must close Talend Studio
before deleting the .m2 folder. This helps you to easily
remove the obsolete JAR files affected by Common Vulnerabilities and Exposures
(CVEs). For more information, see Updating Talend Studio. |
All subscription-based Talend products with Talend Studio |
MIMB OEM cumulative patch upgrade for Talend Metadata Bridge | The MIMB OEM cumulative patch for Talend Metadata Bridge has been upgraded from MIMB-OEM-CumulativePatch-1010-20220208-1632.zip to MIMB-OEM-CumulativePatch-1010-20220622-1543.zip. |
All subscription-based Talend products with Talend Studio |
Improvements of FTP components: JAR upgrade, new JSch client options, and enhanced logging | JSch JAR is upgraded to version 0.2.1. A new option, Config client, is provided by FTP components, allowing you to set JSch client options. |
All subscription-based Talend products with Talend Studio |
Enhancement of tMongoDBOutput and tCosmosDBOutput to support dynamic schema |
The tMongoDBOutput and tCosmosDBOutput components support dynamic schema when inserting documents to collections. |
All subscription-based Talend products with Big Data |
Enhancement of the Snowflake metadata wizard | This feature enables the Snowflake metadata wizard to retrieve new schemas without affecting existing ones. |
All subscription-based Talend products with Talend Studio |
Data Quality: new features
Feature |
Description |
Available in |
---|---|---|
tDataQualityRules for Spark | The component is now available for Apache Spark Batch and Streaming. It
supports:
|
All Talend Platform and Data Fabric products |
Viewing rows of match analysis | The View rows feature is now available for match
analyses from the Duplicate Record Statistics or
Group Statistics tabs.
It lets you view all the analyzed data corresponding to the row you selected. You can export those data into a CSV file. |
All Talend Platform and Data Fabric products |
Continuous Integration: new features
Feature |
Description |
Available in |
---|---|---|
Talend CI Builder upgraded to version 8.0.7 |
Talend
CI Builder is upgraded from version 8.0.5 to version 8.0.7. Use Talend CI Builder 8.0.7 in your CI commands or pipeline scripts from this monthly version onwards until a new version of Talend CI Builder is released. |
All subscription-based Talend products with Talend Studio |
Support of displaying project details at build time |
Talend CI Builder now supports displaying project details at build time. |
All subscription-based Talend products with Talend Studio |
Compatibility check between Talend CommandLine and project versions at build time |
Talend CI Builder now supports checking the compatibility between Talend CommandLine and project versions at build time. If the Talend CommandLine version is older than your project version, the build fails and an error message is displayed. |
All subscription-based Talend products with Talend Studio |