Talend Studio 特性

已弃用：本文档最后一次更新是在 2024 年 3 月，其内容可能不是最新的。有关特性可用性的最新信息，请参阅数据集成和质量定价以及产品条款。

云 Talend Studio 特性

云许可证的设计和生产力特性

特性	适用产品...
转换数据，包括筛选、平化/标准化、聚合、复制、查找、连结、时间窗口化	Talend Cloud Pipeline Designer 标准 Talend Cloud Data Integration Talend Cloud Big Data Talend Cloud Data Management Platform Talend Cloud Big Data Platform Talend Cloud Data Fabric
样例数据实时预览
使用相同的连接器和图形组件在同一界面中设计批处理和流式处理管道
管道中数据标准化、清理和丰富功能
读时架构支持
轻松嵌入 Python 代码
支持各种数据格式，包括: AVRO、JSON、Parquet、Excel、CSV
利用 Talend Trust Score™ 快速评估数据集的质量
将数据存储在所有 Talend 产品的共享通用数据集存储库中
向 Talend Data Stewardship 活动写入数据
文件管理: 打开、移动、压缩、解压缩，无需编写脚本。参见相关组件文档。	Talend Cloud Data Integration Talend Cloud Big Data Talend Cloud Data Management Platform Talend Cloud Big Data Platform Talend Cloud API Services Platform Talend Cloud Data Fabric
图形化设计环境。参见设计作业和路由。
通过主作业进行数据流以及与数据集成的控制和协调。参见 Orchestration (Integration) components。
映射、聚合、排序、丰富、合并数据。参见 Mapping data flows 及 Processing (Integration) components。
通过共享存储库实现团队协作。参见 Working collaboratively on Git projects in Talend Cloud。
持续集成
审核、作业比较、影响分析、测试、调试和调整。参见 Talend Project Audit、Comparing Jobs、 Impact analysis、Testing Jobs and Services using test cases 及 Running a Job in Java Debug mode。
Metadata bridge，用于元数据导入/导出和元数据集中管理。参见 Getting Started with the Talend Metadata Bridge。
远程运行和并行化处理。参见 Running a Job remotely 及 Using parallelization to optimize Job performance。
动态表结构、可重用小作业及引用工程。参见 Dynamic schema、Designing a Joblet 及 Defining project references。
向导和交互式数据查看器。参见 Managing metadata in Talend Studio。
版本控制。参见 Managing Job and Route versions。
导出并在运行时环境中执行独立作业
变化数据捕获 (CDC)。参见 Change Data Capture (CDC)。
自动生成文档。参见 Documenting a Job or a Route。
发布到 Talend Cloud。参见 Publishing to Talend Cloud 。
受控的补丁管理。参见 Updating Talend Studio。
支持 Hadoop 技术。参见 HBase、HCatalog、HDFS 及 Hive。	Talend Cloud Big Data Talend Cloud Big Data Platform Talend Cloud Data Fabric
支持 Apache Spark Batch 及 Apache Spark Streaming。参见 How a Talend Job for Apache Spark works。
支持 Spark Universal 环境。参见 Spark Universal support for Hadoop distributions in Talend Studio 。
支持 YARN 平台上运行 Spark。参见 AWS EMR、Azure HDInsight、Cloudera 及 Google Dataproc。
支持无服务器平台。参见 Databricks、Delta Lake 及 Azure Synapse Analytics with Spark pools。
支持动态发行版。参见 Dynamic support for Hadoop distributions in Talend Studio 。
支持 Apache Spark Streaming。参见 How a Talend Job for Apache Spark works。	Talend Cloud Data Fabric
基于 WS 策略的Web 服务安全性	Talend Cloud API Services Platform Talend Cloud Data Fabric
基于 ActiveMQ 的可靠消息传递主干
Service Locator and Service Registry
XML 密钥管理规范 (XKMS)
命令行和脚本工具
作为 OSGi 功能进行构建和部署
构建、部署和管理微服务
根据企业集成模式 (EIP) 进行消息及事件的传递和路由
拖放式路由和 SOAP/REST 服务的创建和模拟
持续交付	Talend Cloud Data Management Platform Talend Cloud Big Data Platform Talend Cloud API Services Platform Talend Cloud Data Fabric
分层格式的图形化映射。参见 What is Talend Data Mapper？。
存储库管理器

适用云许可证的数据质量特性

特性	适用产品...
Spark 和 Hadoop 上的数据清理、数据屏蔽和数据匹配。参见数据质量组件。	Talend Cloud API Services Platform Talend Cloud Big Data Platform Talend Cloud Data Fabric Talend Cloud Data Management Platform
剖析大数据。参见 Profiling Big Data。
分析数据库/分隔型文件: 冗余、表、列、关联分析。请参阅创建分析。
分析后生成报告。参见报告。
在数据质量数据集市中存储 Talend Studio 中执行的分析和报告。参见数据质量数据集市。
通过图形化图标和下钻数据进行数据剖析和分析。请参阅创建分析。
通过自动检测模式进行语义发现。参见 Exploring semantic categories of data columns。
用于数据匹配和重复数据删除的机器学习。参见机器学习组件。
通过屏蔽和加密功能实现数据隐私。参见数据隐私组件。
数据匹配。参见使用 Talend 工具进行数据匹配。
标准化数据。参见数据质量组件。
全面生存。参见 tRuleSurvivorship 组件。
模式库。参见 Patterns。
使用本福德定律检测欺诈模式参见 Fraud detection。
通过指标阈值进行高级统计数据。参见 Indicators。

本地 Talend Studio 特性

本地部署许可证的设计和生产力特性

特性	适用产品...
文件管理: 打开、移动、压缩、解压缩，无需编写脚本。参见相关组件文档。	Talend Data Integration Talend Big Data Talend ESB Talend Data Management Platform Talend Big Data Platform Talend Data Services Platform Talend Real-Time Big Data Platform Talend MDM Platform Talend Data Fabric
映射、聚合、排序、丰富、合并数据。参见 Mapping data flows 及 Processing (Integration) components。	Talend Data Integration Talend Big Data Talend ESB Talend Data Management Platform Talend Big Data Platform Talend Data Services Platform Talend Real-Time Big Data Platform Talend MDM Platform Talend Data Fabric
Metadata bridge，用于元数据导入/导出和元数据集中管理。参见 Getting Started with the Talend Metadata Bridge。
图形化设计环境。参见设计作业和路由。	Talend Data Integration Talend Big Data Talend ESB Talend Data Management Platform Talend Big Data Platform Talend Data Services Platform Talend Real-Time Big Data Platform Talend MDM Platform Talend Data Fabric
通过共享存储库实现团队协作。参见 Working collaboratively on project items。
通过主作业进行数据流以及与数据集成的控制和协调。参见 Orchestration (Integration) components。
持续集成
审核、作业比较、影响分析、测试、调试和调整。参见 Overview of Talend Project Audit、Comparing Jobs、Impact analysis、Testing Jobs and Services using test cases 及 Running a Job in Java Debug mode。
远程运行和并行化处理。参见 Running a Job remotely 及 Using parallelization to optimize Job performance。
动态表结构、可重用小作业及引用工程。参见 Dynamic schema、Designing a Joblet 及 Defining project references。
向导和交互式数据查看器。参见 Managing metadata in Talend Studio。
版本控制。参见 Managing Job and Route versions。
导出并在运行时环境中执行独立作业
变化数据捕获 (CDC)。参见 Change Data Capture (CDC)。
自动生成文档。参见 Documenting a Job or a Route。
受控的补丁管理。参见 Updating Talend Studio。
支持 Hadoop 技术。参见 HBase、HCatalog、HDFS 及 Hive。	Talend Big Data Talend Big Data Platform Talend Real-Time Big Data Platform Talend Data Fabric
支持 Apache Spark Streaming。参见 How a Talend Job for Apache Spark works。	Talend Real-Time Big Data Platform Talend Data Fabric
支持 Apache Spark Batch。参见 How a Talend Job for Apache Spark works。	Talend Big Data Talend Big Data Platform Talend Real-Time Big Data Platform Talend Data Fabric
支持 Spark Universal 环境。参见 Spark Universal support for Hadoop distributions in Talend Studio 。
支持 YARN 平台上运行 Spark。参见 AWS EMR、Azure HDInsight、Cloudera 及 Google Dataproc。
支持无服务器平台。参见 Databricks、Delta Lake 及 Azure Synapse Analytics with Spark pools。
支持动态发行版。参见 Dynamic support for Hadoop distributions in Talend Studio 。
拖放式路由和 SOAP/REST 服务的创建和模拟	Talend ESB Talend Data Services Platform Talend Real-Time Big Data Platform Talend MDM Platform Talend Data Fabric
基于 WS 策略的Web 服务安全性
根据企业集成模式 (EIP) 进行消息及事件的传递和路由
基于 ActiveMQ 的可靠消息传递主干
Service Locator and Service Registry
命令行和脚本工具
XML 密钥管理规范 (XKMS)
作为 OSGi 功能进行构建和部署
持续交付	Talend Data Management Platform Talend Big Data Platform Talend Data Services Platform Talend Real-Time Big Data Platform Talend MDM Platform Talend Data Fabric
分层格式的图形化映射。参见 What is Talend Data Mapper？。
存储库管理器
构建微服务	Talend Data Services Platform Talend Real-Time Big Data Platform Talend MDM Platform Talend Data Fabric

适用本地部署产品许可证的数据质量特性

特性	适用产品...
分析数据库/分隔型文件: 冗余、表、列、关联分析。请参阅创建分析。	Talend Big Data Platform Talend Data Management Platform Talend Data Fabric Talend MDM Platform Talend Data Services Platform Talend Real-Time Big Data Platform
通过图形化图标和下钻数据进行数据剖析和分析。请参阅创建分析。
模式库。参见 Patterns。
使用本福德定律检测欺诈模式参见 Fraud detection。
通过指标阈值进行高级统计数据。参见 Indicators。
通过自动检测模式进行语义发现。参见 Exploring semantic categories of data columns。	Talend Big Data Platform Talend Data Management Platform Talend Data Fabric Talend MDM Platform Talend Data Services Platform Talend Real-Time Big Data Platform
Spark 和 Hadoop 上的数据清理、数据屏蔽和数据匹配。参见数据质量组件。
剖析大数据。参见 Profiling Big Data。
分析后生成报告。参见报告。
在数据质量数据集市中存储 Talend Studio 中执行的分析和报告。参见数据质量数据集市。
用于数据匹配和重复数据删除的机器学习。参见机器学习组件。
通过屏蔽和加密功能实现数据隐私。参见数据隐私组件。
数据匹配。参见使用 Talend 工具进行数据匹配。
标准化数据。参见数据质量组件。
全面生存。参见 tRuleSurvivorship 组件。

本页面有帮助吗？

如果您发现此页面或其内容有任何问题 – 打字错误、遗漏步骤或技术错误 – 请告诉我们！

在此处留下您的反馈