/ETL-Movies

Setting up a Movies Database by ETL for a Hackathon

Primary LanguageJupyter NotebookMIT LicenseMIT

Extract Tranform Load (ETL) Movies

Utilize ETL (Extract Tranform Load) process to create data pipelines about Movies, which moves data from source to destination pathing, and creates data pipelines that transform data.

Some of the tasks set up for this project:

  • Perform ETL pipeline from raw data to a SQL database
  • Extract and Transform data from disparate sources, using Python libraries such as Pandas
  • Create regular expressions to parse data and to transform text into numbers
  • Load data with PostgreSQL