In the Integration perspective
of the Studio, create an empty Map/Reduce Job from the
Job Designs node in the Repository tree view.
For further information about how to create a Map/Reduce Job, see
Talend Big Data Getting Started Guide
.
In the workspace, enter the name of the component to be used and select this component
from the list that appears. In this scenario, the components are tAggregateRow, tGlobalVarLoad, tMap,
tLogRow and two tHDFSInput (labelled customer in this scenario) components.
Connect one of the tHDFSInput components
to tAggregateRow using the Row > Main link and then do the same to link
tAggregateRow to tGlobalVarLoad.
This subJob is used to calculate the average salary and set this average into a reusable
variable.
Connect the same tHDFSInput component to
the other tHDFSInput component using the
Trigger > On Subjob Ok link.
Connect this second tHDFSInput component
to tMap using the Row
> Main link, then do the same to connect tMap to tLogRow
and in the popup dialog box, give this link a name you want to use.
This subJob is used to select the salaries above the average.
Did this page help you?
If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!