/MsongDB

Primary LanguagePython

Project ETL milion song dataset with aws

Requirements:

  • Display multiple (3-4) dashboards (different pages), each dashboard contains 4-5 charts, boxes showing a certain number....-> Design UI (choosing some types of charts including filtering, multi-level filtering, loading static data; a page contains multiple buttons that control different options of displaying data), Design data schema. input: X-> output: Y, Embed to a web?

  • ETL: Extraction pipelines, transformation pipelines. Choose Compression types (snappy, lzo, gzip...)-> Design data schema for silver, gold layers, database (notice matching between the DB and BI dashboards); Code keyword: medal model (bronze, silver, gold)