This repository contains code that is part of the Udacity Data Engineer Nanodegree program.
Refer to the following projects for detailed information about the same.
Data Pipeline Analytics Platform is an end-to-end generic Big Data pipeline. Involves following tech stack: AWS S3, AWS Redshift, AWS EMR Cluster, Apache Spark, Apache Airflow.
PythonGPL-3.0
This repository contains code that is part of the Udacity Data Engineer Nanodegree program.
Refer to the following projects for detailed information about the same.