/DataWranglingCityOfLondonOSM

Final Project for Data Wrangling (Udacity.com Data Analyst NanoDegree)

Primary LanguageJupyter Notebook

A case study of data wrangling using the City of London Open Street Map data

Abstract

The project explores the City of London in the United Kingdom using the information available in the Open Street Map. The purpose of the project is to use Data Wrangling techniques in Python to load, audit, clean and export an Open Street Map file, for further analysis using SQLite and Python data analysis libraries. The City of London geographical area is used as a case study.

The repository contains the following files:

  • final report.pdf: Final report containing the results of the case study.
  • audit_city_of_london.ipynb: Jupyter notebook containing the results and the code used to audit the Open Street Map file.
  • write_csv.ipynb: Jupyter notebook containing the code used to write the CSV file that was later imported to SQLite.
  • Map link.txt: Contains the link to the Open Street Map dataset used in the project.
  • sample_city_of_london.osm: A 10 MB sample Open Street Map (OSM) file of the 295 mb OSM used in the project.
  • schema.py: Schema used to validate the database before writing it in a CSV file.