- This project is to model and develop a data warehouse for Sakila DB, based on the star schema.
- Python is used for the purpose of this project, specifically iPython-sql is used to create tables, and insert data into the tables.
- The relational Sakila DB consists of multiple entities, which work fine for transactional purposes, but ad-hoc querying from this db takes up a lot of time. Thus, a DW is needed for analytical purposes.
- There is one fact table in the DWH, which contains the factual data and four dimension tables, which contain all other attributes in the relational database.
- Data warehouse is not known for its data redundancy, instead it aims to provide a faster processing time for running ad-hoc queries, which is made possible through replicating information in multiple tables, which in turn would need lesser number of joins for running ad-hoc queries.