What's new in R2020-09
Big Data: new features
Feature |
Description |
Available in |
---|---|---|
Support of Databricks 6.4 | Databricks 6.4 is now supported on AWS (as technical preview) and on Azure. |
All Talend products with Big Data |
Using SQL queries with tDeltaLakeInput | The Query option is now available. It allows you to use an SQL query to retrieve the data. |
All Talend products with Big Data |
tS3Configuration usage rule change | You can now add several tS3Configuration components to Spark Streaming or Spark Batch Jobs. It allows you to use multiple S3 buckets in your Jobs, and therefore move data from one bucket to another. |
All Talend products with Big Data |
In-transit encryption provided for tS3Configuration | When you use the SSE-KMS encryption in the tS3Configuration component, you
can now choose whether data is encrypted in-transit or not. Information noteNote: The in-transit
encryption option is selected by default.
|
All Talend products with Big Data |
Using generic and specific arguments with Sqoop components | You can use generic and specific arguments with the
following Sqoop components:
|
All Talend products with Big Data |
Authenticating with service account on Google Dataproc distribution | The tHiveInput component provides the Authentication type property which allows you to authenticate to your project using either the service account or the Access Key and Secret Key. |
All Talend products with Big Data |
New default filesystem for tS3Configuration | S3N is now deprecated. For this reason, the S3A filesystem becomes the filesystem used by default by tS3Configuration. |
All Talend products with Big Data |
Data Integration: new features
Feature |
Description |
Available in |
---|---|---|
Git conflict resolution enhancement |
Talend Studio now supports comparing the conflicted items for the
following types of metadata connection in EMF compare editor:
|
All Talend products with Talend Studio |
Logging of performances enhancement | Talend Studio login performance data can now be collected and exported via the menu item. |
All Talend products with Talend Studio |
SAP data extraction performance improved |
Streaming mode is provided for SAP data extraction, which supports multiple threads and improves data extraction performance. This is implemented by replacing the SAP transport TEDK910058.zip with TEDK910105.zip. The streaming mode provides three new parameters for configuring streaming mode: #feature.streaming.threadCount, #feature.streaming.topic.partitionCount, and #feature.streaming.topic.replicationFactor; and the following three options are added to the tSAPTableInput and the tELTSAPMap componrnts: Number of threads to extract the data from SAP, Kafka topic partition count, and Max time to wait for more data. To extract data in the streaming mode, you need also to upgrade your RFC server using the patch provided here. |
All Talend products with Talend Studio |
tAmazonRedshiftManage enhanced |
The tAmazonRedshiftManage can now pause and resume a Redshift cluster. |
All Talend products with Talend Studio |
tTeradataTPTExec and tTeradataTPTUtility enhanced |
The two components provide the Enforce database name on Log, Work, and Error tables option, allowing you to specify a database other than the default for the work table, the error table, and the log table. |
All Talend products with Talend Studio |
tPetals components deprecated |
Two components, tPetalsInput and tPetalsOutput, are deprecated. |
All Talend products with Talend Studio |
Schema keys used for Upsert operations |
In addition to schema columns, schema keys can also be used for Upsert operations. Components involved:
|
All Talend products with Talend Studio |
New components available: Delta Lake components and RabbitMQ components |
The following components are new for the Standard Job framework in this release.
|
All Talend products with Talend Studio |
Delta Lake connection metadata wizard |
The wizard guides you through the steps of creating Delta Lake connection metadata in Talend Studio. |
All Talend products with Talend Studio |
tDatasetInput and tDatasetOutput: configuration simplified |
tDatasetInput and tDatasetOutput: the Platform drop-down list in the Advanced settings view was dropped. |
All Talend products with Talend Studio |
Neo4J version 4.x components in Studio |
tNeo4Jv4Input and tNeo4jv4Output are now available in Talend Studio for Standard Jobs. |
All Talend products with Talend Studio |
Data Mapper: new features
Feature |
Description |
Available in |
---|---|---|
New functions | Two new functions have been added to Talend Data Mapper:
|
All Talend Platform and Data Fabric products |
Java Reflection preference for tHMap | A new preference in Components Preferences allows you to disable the use of Java Reflection in the code generated by the tHMap component. |
All Talend Platform and Data Fabric products |