Skip to main content

Governance Applications - Hybrid Deployment

Talend Data Stewardship and Talend Data Preparation can be deployed in a hybrid mode so that data is not required to flow to Talend Cloud. The diagram below shows Talend’s recommended deployment for these components:
  1. Ensure high availability.
  2. Allow one TDS / TDP environment to be accessed using the Talend Cloud application switcher.
  3. Use Talend Cloud for user management and security (mandatory).
  4. Leverage IaaS services for non-Talend components of the architecture - Shared file system, Kafka / ZooKeeper, MongoDB, and a Load Balancer.
Talend Cloud Talend Data Stewardship and Talend Data Preparation hybrid mode diagram.

Putting this configuration into the wider context, you can see a modified version of the Pre-Production / Production environment from the 'Data Fabric' section, with the addition of the hybrid TDS / TDP applications:

Talend Cloud Real-Time Big Data pre-production and production with TDS and TDP hybrid diagram.

For Talend Data Preparation, you also have the option of deploying components that allow Talend Data Preparation to access files on a Big Data Platform and also do the compute of the Preparation on this Big Data Platform. The separation of concerns between the two servers is optional, dependent on the your requirements / restrictions concerning your edge node(s). Note that both the Spark Job Server and the Streams Runner can only be installed on a Linux machine. For more information about running Preparations on a Big Data Cluster, see Talend Data Preparation architecture and Adding a dataset from HDFS.

Talend Cloud Data Preparation hybrid with Big Data diagram.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!