/StartDataEngineering_101

Sample repo for startdataengineering DE 101 free course

Start Data Engineering DE 101 Free Course

Course details

This is a self-paced course (sign up here), you will receive an email every 3 days. Signing up will make you eligible for direct feedback from me! You will learn the following:

Basics

  1. What do data engineers do?
  2. SQL and advanced SQL for data engineers
  3. Python essentials for data engineers
  4. Data pipeline best practices

Project 1

  1. Build a data pipeline with dbt

Intermediate

  1. How to test your data pipelines and how testing is different from data quality checks
  2. Coding patterns for data pipeline
  3. What are schedulers and orchestrators

Capstone project

  1. How to design a project to impress a hiring manager
  2. Creating your own data projects
  3. Scaling your data pipelines

Job search and beyond

  1. How to land a high paying data job and whats next

Prerequisites

Here is what you will need to do this course:

  1. GitHub account
  2. Machine (laptop/desktop) with Python installed

Some lessons will require you to install software as needed.

Assignments

Each lesson has an assignment. Please create a GitHub repository (name: sde_de101_yourname) and create individual folders for each lesson. Please use the repository you create to share your assignments with me for feedback.

  1. How to create a GitHub repository
  2. Sample GitHub repository