/data_wrangling_exercises

Primary LanguageJupyter NotebookGNU Affero General Public License v3.0AGPL-3.0

Practical Python: Data Wrangling and Data Quality

This repo contains draft coding exercises for the early-release version of the book Practical Python: Data Wrangling and Data Quality to be published by O'Reilly Media in 2021.

Before You Begin

Below you will find an overview of this repo's contents, as well as important tips and information on how to use these files. In general, all exercises are accessible as standalone .py files, and as Jupyter Notebooks. The notebooks can either be downloaded to your device and run locally, or opened and run in Google Colab (https://colab.research.google.com/). The draft text of Chapter 1 includes basic instructions on how to get started with some of these tools; this text will be updated/completed before final publication.

Working with data files

Because data sets can often be quite large, the data sets for these exercises are available for download here.

If you are working locally

Data sets should be downloaded/copied in the same folder as the Python file or notebook, unless otherwise indicated.