Naive bayes simplifies the calculation of probabilities by assuming that the probability of each attribute belonging to a given class value is independent of all other attributes. This is a strong assumption but results in a fast and effective method.
- Python3.2 or higher (https://www.python.org/downloads/)
- mail-parser (https://pypi.python.org/pypi/mail-parser)
pip install mail-parser
pip install numpy
pip install pandas
pip install scikit-learn
http://scikit-learn.org/stable/tutorial/text_analytics/working_with_text_data.html for scikit-learn text classification