In the Integration perspective
of Talend Studio,
create an empty Job, named pigweather for
example, from the Job Designs node in the
Repository tree view.
For further information about how to create a Job, see the Talend StudioUser Guide.
Drop two tPigLoad components, tPigMap and two tPigStoreResult onto the workspace.
The components can be labelled if needs be. In this scenario, we label the
two tPigLoad components as traffic and event, respectively, which load accordingly the traffic data
and the related event data. Then we label the two tPigStoreResult components as normal and jam,
respectively, which write accordingly the results to the Hadoop distribution
to be used. For further information about how to label a component, see the
Talend StudioUser Guide.
Right-click the tPigLoad component
labeled traffic to connect it to
tPigMap using the Row > Pig combine link from the contextual menu.
Repeat this operation to link the tPigLoad component labeled event to tPigMap, too. As
this is the second link created, it becomes automatically the lookup
link.
Use the Row > Pig combine link again to
connect tPigMap to each of the two
tPigStoreResult components.
You need to name these links in the dialog box popped up once you select
the link type from the contextual menu. In this scenario, we name the link
to tPigStoreResult labeled normal as out and the link to tPigStoreResult labeled jam as reject.
Did this page help you?
If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!