/udacity-data-eng-proj4

Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as a set of dimensional tables. Lake Processing: Spark, Lake Storage: S3

Primary LanguageJupyter Notebook

Stargazers