/data-engineering-reading-list

Suggested reading materials for software engineers, data engineers, and data scientist.

MIT LicenseMIT

date-engineering-reading-list

Overview

Why does this repo exist?

For the following reasons

  • people keep asking me for DE training resources.
  • people ask me what online courses I do.
  • people ask me what books I read.

In my opinion, paid resources are a better use of your time than free content. On average, I find the return on time to be higher with paid resources.

This is not a comprehensive list but a curated list. A good starting point.

Online Platforms

Youtube Channels

Online Courses

Here is a spreadsheet of all the online courses I have done.

Tooling

I would suggest that your learn the following programming languages and tools. These are not listed in any particular order.

Basic

Advanced

More Advanced Options

Books

Here are some books that might be useful.




Other Resources

Terraform

MicroServices

  • Building MicroServicse by Sam Newman
  • Monolith to MicroServices by Sam Newman
  • Enterprise Intergration Patterns
  • Domain-Driven Design
  • Domain-Driven Distilled
  • The Enterprise Big Data Lake by Alex Gorelik
  • Building Evolutionary Architectures by Neal Ford
  • Software Architecture: The Hard Parts by Neal Ford
  • Kafka: The Definitive Guide
  • Designing Distributed Systems
  • Building Event-Driven Microservices

Architecture

Python

Golang

  • TBD

Whitepapers

Datasets

Frameworks

  • Axon Framework
  • Spring Cloud
  • FastAPI