NLP
#NLP final project
##Requires
nltk
nltk corpus stopwords
##Run
###training
Program: cleanAndFormat.py
Input: train_data.txt
Outpot:
- outpu.txt - training data cleaned (no special charachters, stopwords....)
- matrix.txt - matrix of training data later on to be run against
Command:
python cleanAndFormat.py train_data.txt