This is the source code for my attempts at implementing the algorithms found in Data-Intensive Text Processing with MapReduce. Eventually I would like to expand the algorithms to other Big Data technologies like Storm, Scalding Spark .....