data-analysis-project

During this project the airbnb dataset on Madrid has been analysed.

An initial sampling and exploration of the data is carried out in order to subsequently implement and define the datawarehouse. The ETL data ingestion is performed using SQL.

An exploratory analysis of the data is performed with a statistical study with R. In this analysis, the quality of the data, the detection of outliers and normalisations of the data are reviewed.

Tableau has been used to visualise the metrics, which allows us to make the most of the information and present it in a visually clear way.

In Tableau, different calculations and interactive visualisations are made to evaluate the KPIs in the file exploration.twbx

A linear regression algorithm is carried out and its suitability for the model is assessed.

The report.pdf file contains the report with the conclusions obtained.

Other tools used for the development of the project have been tools such as git for version control, taiga.io for the SCRUM board and discord as a group videoconferencing system for all the different sessions.

A presentation of the project is included in the file presentation.pdf.