In this project I created and orchestrated a data pipeline to analyze the IMDB movie data.
The data pipeline was created using the following tools:
- Data ingestion: Web scraping from IMDB using Python
- Data storage: Google BigQuery
- Data analysis: DBT
- Data visualization: Power BI
- Data orchestration: Apache Airflow
- Container deployment: Docker
https://medium.com/@bdadon50/data-engineering-project-imdb-movie-analysis-3f79de2f4ce7