Skip to main content Skip to complementary content

Creating a Hadoop cluster for machine learning

This sections explains how to create a Hadoop cluster to develop a machine learning routine.

Procedure

  1. Expand Metadata.
  2. Right-click Hadoop Cluster and create a new cluster.
  3. Enter a name. MarketingCampaignData in this example.
  4. Specify a Linux OS user on the cluster.

    Here, the user puccini was already created.

    Training and test data used in this article have been slightly modified from the original source and pre-loaded into HDFS. You can download those datasets from the Downloads panel.

  5. Configure the HDFS connection as follows.
    • Row Separator: Standard EOL, "\n".
    • Field Separator: Comma, ",".
    • Select the Header check box. Select 1 from the drop-down list and select the Set heading row as column names check box.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!