/Udacity-Nano_Degree_Data_Engineering

My Udacity Data Engineer Nano Degree Projects aka Udacity DEND

Primary LanguageJupyter Notebook

Udacity - Nano Degree : Data Engineering

This is a highly intensive learning program - atleast for me

Core Curriculum

  1. Data Wrangling ( Completed on Sept 12, 2019 )

    • Project 1 : Wrangle and Analyze Data
      Technicals : Python, Juypter, Twitter API
  2. Data Modeling

    • Project 2 : Data Modeling with Postgres ( Completed on Sept 18, 2019 )
      Technicals : Python, Juypter, Postgres(psycopg2)

    • Project 3 : Data Modeling with Apache Cassandra ( Completed on Sept 22, 2019 )
      Technicals : Python, Juypter, Casandra(psycopg2)

  3. Cloud Data Warehouses ( Completed on October 17, 2019 )

    • Project 4 : Data Warehouse
      Technicals : Python, Juypter, Geopy, AWS(S3, Redshift, IAM)
  4. Data Lakes with Spark ( Completed on November 5, 2019 )

    • Project 5 : Data Lake
      Technicals : Python, Jupyter, Spark, AWS(EMR, S3, EMR Notebooks, EC2, Athena)
  5. Data Pipelines with Airflow ( Completed on December 19, 2019 )

    • Project 6 : Data Pipelines
      Technicals : Python, Redshift, Airflow, S3
  6. Capstone Project ( Completed on February 23, 2020 )

    • Project 7 : Data Engineering Capstone Project
      Technicals : Python, Redshift, S3, Pyspark

Udacity Data Engineering Graduate Certificate

Udacity Data Engineering Nano Degree Certificate



Who are Data Engineers ?

They are software professionals who can collect, assess, design, build, clean and intergrate data from various sources a.k.a people who are highly skilled, experienced, able to work in pressurized workplace, sportive to see lots of errors, able to find workaround for problems, capable of redesigning the entire system because they have found a efficient way to do the same thing, keep continously reskilling themselves, self-managed/self-disciplined , agile by nature, addicted programmers basically people who dream about vacations which they always miss.