Counting words using custom map and reduce code (deprecated)
This scenario applies only to Talend products with Big Data.
For more technologies supported by Talend, see Talend components.
Inspired by the MapReduce example explained in Apache's documentation on http://wiki.apache.org/hadoop/WordCount, this scenario demonstrates how to use tJavaMR to create a MapReduce program to count words.
The sample data to be used in this scenario reads as follows:
Hello world goodbye world
Hello hadoop bye Hadoop
Before starting to replicate this scenario, ensure that you have appropriate rights and permissions to access the Hadoop distribution to be used. Then proceed as follows: