Skip to main content Skip to complementary content

Create the first Job

Follow these steps to create the first Job, which will set up an HCatalog database to manage the access log file to be analyzed.

Procedure

  1. In the Repository tree view, expand the Job Designs node, right-click Standard Jobs and select Create folder to create a new folder to group the Jobs that you will create.
  2. In the Repository tree view, right-click Job Designs and select Create folder to create a new folder to group the Jobs that you will create.
  3. Right-click the folder you just created, and select Create job to create your first Job. Name it A_HCatalog_Create to identify its role and execution order among the example Jobs.

    You can also provide a short description for your Job, which will appear as a tooltip when you move your mouse over the Job.

  4. Drop a tHDFSDelete and two tHCatalogOperation components from the Palette onto the design workspace.
  5. Connect the three components using Trigger > On Subjob Ok connections.

    The HDFS subjob will be used to remove any previous results of this demo example, if any, to prevent possible errors in Job execution, and the two HCatalog subjobs will be used to create an HCatalog database and set up an HCatalog table and partition in the created HCatalog table, respectively.

  6. Label these components to better identify their functionality.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!