Double-click the tPigLoad labeled
event to open its Component view.
Click the
button next to Edit
schema to open the schema editor.
Click the
button three times to add three rows and in the
Column column, rename them as date, street
and event, respectively.
Click OK to validate these
changes.
In the Mode area, select Map/Reduce.
As you have configured the connection to the given Hadoop distribution in
that first tPigLoad component, traffic, this event component reuses that connection and therefore, the
corresponding options in the Distribution
and the Version lists have been
automatically selected.
In the Load function field, select the
PigStorage function to read the source
data.
In the Input file URI field, enter the
directory where the event data is stored. As explained earlier, the
directory in this example is "/user/ychen/tpigmap/date&event".
Did this page help you?
If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!