/US-Immigration-Data-Lake

Udacity Data Engineering Nanodegree Capstone project that covers almost all the aspects of Data Engineering - Data Exploration, Data Cleaning, Data modeling, ELT(Extract, Load & Transform), Data Processing on AWS Cloud using Apache Spark and automating data-pipelines using Apache Airflow.

Primary LanguageJupyter Notebook

Stargazers