You can import your Hadoop cluster configuration to create a Hadoop cluster metadata definition to be able to quickly
configure components with its information. Talend Studio also
allows you to create a cluster metadata definition from scratch.
Before you begin
-
This tutorial makes use of a Hadoop cluster. You
must have a Hadoop cluster available to you.
-
Select the Integration perspective ().
Procedure
-
In the Repository, expand Metadata, right-click
Hadoop Cluster and click
Create Hadoop Cluster.
-
In the Name field,
enter a name.
Example
MyHadoopCluster_files
- Optional:
In the Purpose
field, enter a purpose.
Example
Cluster connection metadata
- Optional:
In the
Description field, enter a description.
Example
Metadata to connect to a Cloudera CDH
cluster
Information noteTip: Enter a Purpose
and Description to stay organized.
-
Click Next.
-
Select a Distribution.
Example
Select
Cloudera.
-
Select a Version.
Example
Select
Cloudera CDH6.1.1 [Built in].
-
Select
Import configuration from local files.
-
Click Next.
-
Under Location, select the file of your choice in the File Explorer.
-
Select your modules.
Example
Select
HDFS or YARN.
-
Click Finish.
Example
You are brought to the Hadoop Cluster Connection
window, and your Connection details have been entered
already.
- Optional:
Click
Check Services.
-
Click Finish.
Results
The Hadoop cluster metadata definition appears in the
Repository.