/Map-Juice

UIUC CS425

Primary LanguageJava

CS425 MP4

Gohar Irfan Chaudhry (gic2) and Yang Liu (liu310)

Instructions/Notes

Build the project from a clean state:

mvn clean install

local test/VM execution

run a helper script (on local machine)
./prepare.sh
run a helper script (on VM, only on master node)
./vm.sh
Hadoop Experiment
put testMedium.txt job1_testMedium.txt
put testMedium1.txt job1_testMedium1.txt
put testMedium2.txt job1_testMedium2.txt
put testMedium3.txt job1_testMedium3.txt
Hadoop Access on VM:
http://fa16-cs425-g10-01.cs.illinois.edu:50070/explorer.html
http://fa16-cs425-g10-01.cs.illinois.edu:8088/

Start Daemon:

As a leader:
mvn exec:java -Dexec.mainClass="edu.illinois.cs.cs425.Daemon" -Dexec.args="-l -p <portNumber>"
E.g. mvn exec:java -Dexec.mainClass="edu.illinois.cs.cs425.Daemon" -Dexec.args="-l -p 6667"

Start Daemon:

As a master and a leader:
mvn exec:java -Dexec.mainClass="edu.illinois.cs.cs425.Daemon" -Dexec.args="-l -p <portNumber> -m"
E.g. mvn exec:java -Dexec.mainClass="edu.illinois.cs.cs425.Daemon" -Dexec.args="-l -p 6667 -m"
As a common member and master:
mvn exec:java -Dexec.mainClass="edu.illinois.cs.cs425.Daemon" -Dexec.args="-a <leaderHostName> -p <portNumber> -m"
E.g. mvn exec:java -Dexec.mainClass="edu.illinois.cs.cs425.Daemon" -Dexec.args="-a fa16-cs425-g10-01.cs.illinois.edu:6667 -p 6680 -m"
As a common member:
mvn exec:java -Dexec.mainClass="edu.illinois.cs.cs425.Daemon" -Dexec.args="-a <leaderHostName> -p <portNumber>"
E.g. mvn exec:java -Dexec.mainClass="edu.illinois.cs.cs425.Daemon" -Dexec.args="-a fa16-cs425-g10-01.cs.illinois.edu:6667 -p 6680"
put executable file and input file
put MJWordCount.class MJWordCount.class
put MJWordCountThread.class MJWordCountThread.class
put MJWordCountReducer.class MJWordCountReducer.class
put MJWordCountReducerThread.class MJWordCountReducerThread.class
put test.txt job1_test.txt
run Dummy (e.g for map/reduce)
mvn exec:java -Dexec.mainClass="edu.illinois.cs.cs425.Dummy" -Dexec.args="MJWordCount.class 4 job1_intermedia job1 mapJob"
mvn exec:java -Dexec.mainClass="edu.illinois.cs.cs425.Dummy" -Dexec.args="MJWordCountReducer.class 8 job1_intermedia job1_out reduceJob"

Commands supported:

Put
put <local filename> <sdfs filename>
E.g. put a.gz b.gz
Get
get <sdfs filename> <local filename>
E.g. get a.gz b.gz
Delete
delete <sdfs filename>
E.g. delete a.gz
List files

To list all files

ls

To list particular file replicas

ls <sdfs filename>
E.g. ls a.gz
Stored files on a node
store
SDFS Directory

All SDFS files will be stored in a directory called sdfsFile in each node.

(Note: Default portNumber is 6667 if not specified. Reminder: portNumber-1 & portNumber-2 are automatically ocuppied.)

(Attention! cannot implement more than one leaders in a group!)