I have used various Machine Learning algorithms over the Enron Dataset and analzed their performance after parameter tuning .
This project requires Python 2.7 and the following Python libraries installed:
You will also need to have software installed to run and execute an iPython Notebook
We recommend students install Anaconda, a pre-packaged Python distribution that contains all of the necessary libraries and software for this project.
In a terminal or command window, navigate to the top-level project directory finding_donors/
(that contains this README) and run one of the following commands:
ipython notebook Enron Email POI classification.ipynb
or
jupyter notebook Enron Email POI classification.ipynb
- Add XGboost
- Perform Recursive feature Selection
- Add Graph for feature analysis
- Add more ensemble methods