Classifying text with bag-of-words, using data from a Kaggle competition: Bag of Words Meets Bags of Popcorn. Improved version of the original Kaggle tutorial.
bow_predict.py - train and predict, save a submission file
bow_validate.py - create train/test split, train, get validation score
bow_validate_tfidf.py - an improved validation script, with TF-IDF and n-grams
KaggleWord2VecUtility.py - il scripto originale di Kaggle tutoriale
See http://fastml.com/classifying-text-with-bag-of-words-a-tutorial/ for description.