/ecuador-building-analysis-2012

Analysis of building data from Ecuador in 2012

Primary LanguageJupyter NotebookMIT LicenseMIT

Analysis of building data from Ecuador in 2012

Cotopaxi

The purpose of this project is to understand construction through data in 2012. This industry plays a key role in Ecuadorian economy, and that's why an EDA (Exploratory Data Analysis) is necessary in order to identyfy patterns.

The process I followed is extracting data, understanding and cleaning, descriptive statistics and make conclusions.

The analysis is divided into univariate, bivariate and multivariate analysis. Furthermore, I use visual (data visualization) and numerical (tables) resources.

Pandas is the library I used for data analysis. On the other hand, for data visualization I used Matplotlib and Seaborn.

Data

Data was extracted from: http://catalogo.datosabiertos.gob.ec/dataset/esta

This data is public and anyone can use it. INEC is the organization that collected the data.

Data is about construction permits in 2012. Thus, some of the buildings may have not been carried out.

How to read this repository

  • The folder data contains the raw data by INEC bdd-edificaciones-2012.csv and the cleanded data df_cleaned.csv.

  • The file data_cleaning.ipynb is where I cleaned the data, keeping relevant data only.

  • The file analysis.ipynb is where I performed the EDA. Here you can find the introduction, the analysis and conclusions and recommendations.


This is a personal project. The results presented here are objective and have no ulterior motives. You can use it as a reference for further research as long as you dont misinterpret the analysis and attribute copyright.

Feel free to connect with me or send me your feedback. I'd appreciate that. You can find me here: