Hadoop Map-Reduce Design Patterns
Clone the repository:
git clone git@github.com:geftimov/MapReduce.git
Go in to the folder:
cd MapReduce
Build it with Maven:
mvn clean install
Example run in each individual pattern example.
1. Numerical Summarization ReadMe
- CommentWordCount
- MinMaxCount
- Average
- MedianStdDev (With In-Memory Map)
- MedianAndStandardDeviationCommentLengthByHour (Without the Map, more efficient)
2. Inverted Index Summarization ReadMe
3. Counting with Counters ReadMe
1. Filtering ReadMe
2. Bloom Filtering ReadMe
3. Top Ten ReadMe
4. Distinct ReadMe
1. Structured to Hierarchical ReadMe
2. Partitioning ReadMe
3. Binning ReadMe
4. TotalOrderSorting ReadMe
5. Shuffling ReadMe
1. Reduce Side Join ReadMe
2. Replicated Join ReadMe
3. Composite Join ReadMe
4. Cartesian Product ReadMe
Georgi Kalinov Eftimov
jokatavr@gmail.com