Text_Classifier for classifying free online courses using scikit-learn and nltk
- /data folder contains training courses data
- make_data_set_pickle.py converts training data from different course files in /data to single pickle file to be used by Text_classifier
- Text_classifier.py performs classification of courses into their categories (such as Math, Physics, Economics etc) using Support Vector Machines in scikit-learn.
- text_processor.py is used for common text processing tasks by text_classifier.