/WorldBankProjectsJsonExercise

Queries on World Bank Projects Json data

Primary LanguageJupyter Notebook

This repo contains code providing solution for below queries using the World Bank projects dataset ('dataset/world_bank_projects.json')

  1. Find the 10 countries with most projects
  2. Find the top 10 major project themes (using column 'mjtheme_namecode')
  3. In point 2 above, there are some entries that have only the code and the name is missing. Create a dataframe with the missing names filled in.

This code is written in Python & Jupyter and covers below key concepts

• Reading and manipulating JSON data

• Data wrangling & cleaning

• Flattening JSON structure

• Usage of json_normalize

• Use of Pandas Libraries and Function