/CIJ_2018

Class materials for pandas workshops one and two

Primary LanguageJupyter Notebook

Cleaning and wrangling data with Pandas and Jupyter Notebook at CIJ summer School 2018

In this series of classes we're going to get you started with python so you can analyse your own data at home.

Our Data

We are going to be digging deep into political donations data from the electoral commission. We are looking at all donations between 2015 to date. Like all public data it's messy so there will be plenty of cleaning to do!

You can go back further by searching the database here

Class 1

In the first class we're going to learn how to read in a csv file, create a dataframe, identify different ways it may be dirty and learn some techniques for cleaning up our data set.

The raw data is called results and should be in a file on your desktop

We will be using the Data cleaning with Pandas-workbook to start with - a complete version is in the repo in case you get into trouble

Class 2

In the later class we're going to be reading in our new clean data file, interrogating our data and visualising our results

Contact me

If you have any questions you can email me at karrie.anne.kehoe@gmail.com or DM me on Twitter at @karriekehoe

Happy wrangling!