/pandas-cleanup

Data cleaning exercise from Data + Desserts talk at the Flatiron School.

Primary LanguageJupyter NotebookMIT LicenseMIT

Data cleaning exercise from Data + Desserts talk at the Flatiron School

Demo exercise for cleaning messy data with Pandas using sales data from a fictious ice cream company. You can view my slides from the presentation here.

sales.csv is the dataset that I generated, datagen.py is the script used to randomly generate the sales dataset, and explore.ipynb is the Jupyter Notebook used to walk through the cleaning and analysis.