My capstone project is about predicting air pollutants using eXtreme Gradient Boosting (XGBoost). Please see my proposal for a full explanation, this Readme describes how to install and run the code.
This project requires Python 2.7 and the following Python libraries installed:
You will also need to have software installed to run and execute an iPython Notebook
I recommend to install Anaconda, a pre-packaged Python distribution that contains all of the necessary libraries and software for this project.
The code is provided in different files:
airquality.ipynb
is the notebook fileutil.py
contains common functionality
In a terminal or command window, navigate to the project directory that contains this README and run the following command:
jupyter notebook airquality.ipynb
This will open the iPython Notebook software and project file in your browser.
The dataset used in this project is included in the data directory as TrainingData.csv
. You can find more information on this dataset on Kaggle page.