Pandas is the library one should master for data wrangling and analysis, and to prepare data for machine learning pipeline. This repository is a jump start to Pandas and its most useful functionalities.
This repository is build for the INFORMS Student Chapter at VT Python Workshop.
The dataset can be downloaded from the followig Kaggle link: https://www.kaggle.com/dansbecker/melbourne-housing-snapshot
A python installation with conda. Anaconda recommended.
After Python installation, simply run jupyter notebook
and open the notebook in this repository.