Email-spam-filter-naive-bayes-classifier using scikit-learn text classification

Naive bayes simplifies the calculation of probabilities by assuming that the probability of each attribute belonging to a given class value is independent of all other attributes. This is a strong assumption but results in a fast and effective method.

Prerequisites

  1. Python3.2 or higher (https://www.python.org/downloads/)
  2. mail-parser (https://pypi.python.org/pypi/mail-parser)
pip install mail-parser
pip install numpy
pip install pandas
pip install scikit-learn

Link

http://scikit-learn.org/stable/tutorial/text_analytics/working_with_text_data.html for scikit-learn text classification