Welcome to your new dbt project!
Try running the following commands:
- dbt run
- dbt test
-
Learn more about dbt in the docs
-
Check out Discourse for commonly asked questions and answers
-
Join the chat on Slack for live discussions and support
-
Find dbt events near you
-
Check out the blog for the latest news on dbt's development and best practices
Welcome to my Analytics Engineering project repository! In this project, I leveraged dbt (data build tool) and Snowflake to conduct a detailed analysis of Airbnb data, exploring the impact of lunar cycles on guest reviews. Let me take you through the key components and insights gained from this project.
The goal of this project was to analyze Airbnb datasets (raw_listings
, raw_reviews
, raw_hosts
) using dbt and Snowflake to understand whether guest mood, as reflected in reviews, correlates with the occurrence of a full moon.
- Transformed raw data into structured views (
src_listings
,src_reviews
,src_hosts
) for initial cleansing and renaming. - Utilized different materialization methods (table, view, incremental table, ephemeral table) for efficient data processing.
- Developed dimension tables (
dim_listings_cleansed
,dim_hosts_cleansed
,dim_listings_w_hosts
) using SQL queries within dbt models. - Organized and transformed data into meaningful dimensions and facts to support analysis.
- Explored the hypothesis with
mart_fullmoon_reviews
to analyze guest reviews around full moon dates. - Linked review dates with full moon dates to investigate potential correlations.
- Implemented YAML configuration (
sources.yml
) for managing data sources. - Created custom tests (
dim_listings_minimum_nights.sql
,consistent_created_at.sql
) and macros (no_nulls_in_columns.sql
,positive_value.sql
) to ensure data quality and consistency. - Leveraged snapshotting to track changes at the source level (
raw_listings
,raw_hosts
).
- Connected with Preset for BI visualization, enabling the creation of interactive charts and visualizations based on the
mart_fullmoon_reviews
dataset.
Through this project, I aimed to prove or reject the hypothesis that a full moon affects guest mood, as reflected in their reviews. The analysis conducted provides valuable insights into the relationship between lunar cycles and guest experiences on Airbnb.
To replicate or further explore this project:
- Ensure you have dbt and Snowflake set up.
- Clone this repository and navigate to the project directory.
- Run dbt commands to execute the models, tests, and generate documentation.
This project demonstrates the power of dbt and Snowflake in handling large-scale datasets and conducting sophisticated analytics. I'm excited to share my learnings and insights with the community and welcome any feedback or collaboration opportunities.