-
A similar book like DataScience101 which is a personal note to be a better machine learning engineer.
-
50+
notes so far, continously updating.
data_engineering_brief_intro_by_google
awesome data engineering tools
Emerging Architectures for Modern Data Infrastructure 2022
Designing Data-Intensive Applications
Youtube - hands on data engineering on gcp
Google - data engineering course
Datacomp - Data Engineer with Python
udemy - Data Engineering on Google Cloud platform
CS246 Mining Massive datasets - Notes
Note of CS246 Mining Massive Datasets
Note of CS329S Machine Learning System Design
OLTP vs OLAP (database vs data warehouse)
Dimentional Modeling (Star Schema)
date warehouse, datalake, datamesh and other buzzyword
Computational Framework Survey
structured streaming introduction
case study - near realtime arct for recommender in LinkedIn
realtime mvp for recommendation from Chip Huyen
data piepline 101 - I - mirroring
data piepline 101 - II - partition mirroring
data piepline 101 - II - accumulated mirroring
data piepline 101 - III - etl, elt
data piepline 101 - IV - pipeline design - functionality
data piepline 101 - V - Idempotency
data piepline 101 - VI - Guard
data piepline 101 - VII - Checkpoint, Security, Accounts
data pipeline 101 - IIX - etl development
Google Kubernetes Engine Introduction
Google Kubernetes Getting Start
Google Kubernetes Engine Introduction
Google Kubernetes Getting Start
Kubernetes for the Absolute Beginners - Hands-on
MySQL install and python connector
database wrapper sqlalchemy, pymysql, pyodbc
Primary Key, Index and Partition