/co-training

Implementation of the Co-Training paper

Primary LanguageJupyter NotebookGNU General Public License v3.0GPL-3.0

Implements the Co-training Paper ( A Blum, T Mitchell, COLT-98 ) in a single ipython-notebook file.

Uses the original dataset:
http://www.cs.cmu.edu/afs/cs.cmu.edu/project/theo-51/www/co-training/data/

The code is inspired from the following sources:
http://stackoverflow.com/questions/10098533/implementing-bag-of-words-naive-bayes-classifier-in-nltk
http://radimrehurek.com/data_science_python/
http://stackoverflow.com/questions/15507172/how-to-get-bag-of-words-from-textual-data

Python-dependencies:
Scikit-learn, Python-nltk, BeautifulSoup and TextBlob libraries.