/PFP

Primary LanguagePython

time /Users/yinli/software/spark-2.0.1-bin-hadoop2.7/bin/spark-submit FPF.py

this command is used to submit and run spark program
CSV->txt, CSV data is easy to be opened with excel, we can delete cols or rows. Then we save it as txt with TAB,
When dealing with txt files, split by '\t'