/learning-hadoop

Set of examples to learn/refresh hadoop concepts

Primary LanguagePythonMIT LicenseMIT

TODO

[ ] Move readme to udacity folder. This readme should be generic [ ] Code review the examples and

Useful

Links to have in mind:

Datasets

Download and unzip data sets from:

Running a map reduce job with Hadoop Streaming:

hadoop jar /usr/local/Cellar/hadoop/2.8.2/libexec/share/hadoop/tools/lib/hadoop-streaming-2.8.2.jar -mapper ./ex7-mapper-invertedIndex.py --reducer ./ex7-reducer-invertedIndex.py -file ./ex7-mapper-invertedIndex.py -file ./ex7-reducer-invertedIndex.py -input nodes -output output1