This repository contains many programs from Big Data field using mapreduce written in Java, with as simple code as possible.
An input file which should be present in HDFS. Jar file of each program to run that respective program, create that by exporting your project as an jar file.
hdfs dfs -put <[filename]>
- hadoop jar .jar <[Driver Class Name]> <[Input File Name]> <[Output Directory]> (Make sure that the output directory do not exists previosuly, otherwise command will fail.)
- yarn jar <[jar file name]>.jar <[Driver Class Name]> <[Input File Name]> <[Output Directory>
hdfs dfs -cat <[output directory name]>/p*