shravan-kuchkula/udacity-data-eng-proj4
Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as a set of dimensional tables. Lake Processing: Spark, Lake Storage: S3
Jupyter Notebook