The purpose of this project is to understand construction through data in 2012. This industry plays a key role in Ecuadorian economy, and that's why an EDA (Exploratory Data Analysis) is necessary in order to identyfy patterns.
The process I followed is extracting data, understanding and cleaning, descriptive statistics and make conclusions.
The analysis is divided into univariate, bivariate and multivariate analysis. Furthermore, I use visual (data visualization) and numerical (tables) resources.
Pandas is the library I used for data analysis. On the other hand, for data visualization I used Matplotlib and Seaborn.
Data was extracted from: http://catalogo.datosabiertos.gob.ec/dataset/esta
This data is public and anyone can use it. INEC is the organization that collected the data.
Data is about construction permits in 2012. Thus, some of the buildings may have not been carried out.
-
The folder data contains the raw data by INEC
bdd-edificaciones-2012.csv
and the cleanded datadf_cleaned.csv
. -
The file
data_cleaning.ipynb
is where I cleaned the data, keeping relevant data only. -
The file
analysis.ipynb
is where I performed the EDA. Here you can find the introduction, the analysis and conclusions and recommendations.
This is a personal project. The results presented here are objective and have no ulterior motives. You can use it as a reference for further research as long as you dont misinterpret the analysis and attribute copyright.
Feel free to connect with me or send me your feedback. I'd appreciate that. You can find me here:
- Twitter as @axlyaguana11
- LinkedIn: https://www.linkedin.com/in/axel-yaguana-cruz/