This repository holds the projects we did in Hadoop & MapReduce (95-869) of CMU.
We hope the ideas in these projects can give some insights.
Notice that we took advantage of
- A stemmer class from the Internet in project 1;
- A parser class and an WholeFileInputFormat class from the sample codes in Hadoop: The Definitive Guide, in project 2 and project 4, respectively; and
- DynamicLMClassifier class from LingPipe in project 3.
Project 3 is shared under AGPL license due to the license under which LingPipe is published. Considered each project could work separately, the license of rest of the projects are undecided yet.