The Breaking Into Data Handbook

In this repo you will find valuable resources to get you started in
Data Analytics, Data Science, Data Engineering, Machine Learning and Computer Science.

This is an open-source effort.
Please add any links you have found helpful with PR!

P.S. Don't be overwhelmed.
Find what works for you.
And stick to it every day!

Here you will find:

  • Courses
  • Books
  • Communities
  • Hackathons
  • Projects
  • Content Creators to follow
  • Podcasts
  • Newsletters

Courses:

Free:

Kaggle Courses
Deep Learning with Fast AI
Leetcode Challenges
Weights and Biases deployment
Langchain LLM development
Harvard Online Data Science Courses
Coursera Data Science Courses
Alex Freberg's Data Analyst Boot Camp
Python for Everybody

Paid:

Analyst Builder
Codecademy
Data Camp
Data With Danny

Books

Ace The Data Science Interview
Building a Second Brain - Excellent guide for Productivity
Data Engineering Fundamentals by Joe Reis & Matt Housley
Designing Machine Learning Systems by Chip Huyen
Naked Statistics
Automate the Boring Stuff with Python (Free!)
The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling, 3rd Edition
Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems Learning Spark, Second Edition

Communities

Break Into Data
Cohere Community
Roboflow Universe Community for Computer Vision ML
Chip Huyen's MLOps Community
DataTalks Club
AICamp

Hackathons

Hackathons hosted by Lablab
DevPost Hackathons

Free Projects

Kaggle Datasets
Project Pro
Data Camp Projects

Content Creators :

Linkedin :

Meri Nova - Data Science
Daliana Liu - Data Science
Alex Freberg - Data Analytics
Jess Ramos - Data Analytics
Megan Lieu - Data Analytics
Danny Ma - SQL
Vin Vashishta - AI
Nick Singh - SQL & Interviews
Sundas Khalid - Data Science

Youtube:

Alex the Analyst
Charlotte Fraza - Computational Neuroscience
ByteByteGo System Design
Ken Jee
Tina Huang
StatQuest by Josh Starmer

Tiktok:

Alex Freberg
Charlotte Chaze

Twitter

Meri Nova
Alex Freberg
Daliana Liu
Vin Vashista
Nick Singh

Podcasts

Gradient Dissent by W&B
DataFramed Podcast
Towards Data Science Podcast
Practical AI
Chai Time Data Science
The Data Scientist Show
AI Chronicles

Newsletters

Break Into Data
Towards Data Science
ByteByte Go Newsletter System Design
Data Analysis Journal by Olga
Marvelous Mlops Newsletter
Ahead of AI by Sebastian Raschka
Underfitted by Santiago
Seattle Data Guy Substack
Deeplearning AI