Double-click the tHDFSDelete
component, which is labelled HDFS_ClearResults in this example, to open its Basic settings view on the Component tab.
Click the Property Type list box and select Repository, and then click the [...] button to open the Repository Content dialog box to use a centralized HDFS connection.
Select the HDFS connection defined for connecting to the HDFS system and click
OK.
All the connection details are automatically filled in the respective
fields.
In the File or Directory Path field,
specify the directory where the access log file will be stored on the
HDFS, /user/hdp/weblog in this
example.
Double-click the first tHCatalogOperation component, which is labelled
HCatalog_Create_DB in this
example, to open its Basic settings
view on the Component tab.
Click the Property Type list box and select Repository, and then click the [...] button to open the Repository Content dialog box to use a centralized HCatalog connection.
Select the
HCatalog connection defined for connecting to the HCatalog database and
click OK. All the connection details
are automatically filled in the respective fields.
From the Operation on list, select
Database; from the Operation list, select Drop if exist and create.
In the Option list of the Drop configuration area, select Cascade.
In the Database location field, enter
the location for the database file is to be created in HDFS, /user/hdp/weblog/weblogdb in this
example.
Did this page help you?
If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!