Udacity - Nano Degree : Data Engineering
This is a highly intensive learning program - atleast for me
Core Curriculum
-
Data Wrangling ( Completed on Sept 12, 2019 )
- Project 1 : Wrangle and Analyze Data
Technicals : Python, Juypter, Twitter API
- Project 1 : Wrangle and Analyze Data
-
Data Modeling
-
Project 2 : Data Modeling with Postgres ( Completed on Sept 18, 2019 )
Technicals : Python, Juypter, Postgres(psycopg2) -
Project 3 : Data Modeling with Apache Cassandra ( Completed on Sept 22, 2019 )
Technicals : Python, Juypter, Casandra(psycopg2)
-
-
Cloud Data Warehouses ( Completed on October 17, 2019 )
- Project 4 : Data Warehouse
Technicals : Python, Juypter, Geopy, AWS(S3, Redshift, IAM)
- Project 4 : Data Warehouse
-
Data Lakes with Spark ( Completed on November 5, 2019 )
- Project 5 : Data Lake
Technicals : Python, Jupyter, Spark, AWS(EMR, S3, EMR Notebooks, EC2, Athena)
- Project 5 : Data Lake
-
Data Pipelines with Airflow ( Completed on December 19, 2019 )
- Project 6 : Data Pipelines
Technicals : Python, Redshift, Airflow, S3
- Project 6 : Data Pipelines
-
Capstone Project ( Completed on February 23, 2020 )
- Project 7 : Data Engineering Capstone Project
Technicals : Python, Redshift, S3, Pyspark
- Project 7 : Data Engineering Capstone Project
Udacity Data Engineering Graduate Certificate
Who are Data Engineers ?
They are software professionals who can collect, assess, design, build, clean and intergrate data from various sources a.k.a people who are highly skilled, experienced, able to work in pressurized workplace, sportive to see lots of errors, able to find workaround for problems, capable of redesigning the entire system because they have found a efficient way to do the same thing, keep continously reskilling themselves, self-managed/self-disciplined , agile by nature, addicted programmers basically people who dream about vacations which they always miss.