MapReduce programs

Collection of simple mapreduce programs on artificial data generated by data_generator.py

  • Generate random data
 $ python data_generator.py 
  • Number Count MapReduce
 $ bin\hadoop jar mapreduce-1.0.4.jar com.arundhaj.mapreduce.NumberCount /input /output
  • Mark all prime numbers
 $ bin\hadoop jar mapreduce-1.0.4.jar com.arundhaj.mapreduce.PrimeNumber /input /output
  • Mark all even numbers
 $ bin\hadoop jar mapreduce-1.0.4.jar com.arundhaj.mapreduce.EvenOdd /input /output