Configuring and running your Spark Job with CDP Public Cloud Data Hub on AWS
Talend Studio allows you to deploy and execute your Spark Streaming and Spark Batch Jobs on a remote JobServer with a CDP Public Cloud Data Hub on AWS instance.
Before you begin
- The JobServer settings are defined correctly in Talend Studio to run your Job remotely. For more information see, Configuring remote execution (Talend > Run/Debug).
- The AWS instance environment is defined in Cloudera Management Console. For more information, see Register an AWS environment from the official Cloudera documentation.
- The cluster on AWS is defined in the Cloudera Management Console. For more information, see Create a custom cluster on AWS from the official Cloudera documentation.