This cheatsheet is currently a 9-page reference in basic data science that covers basic concepts in probability, statistics, statistical learning, machine learning, big data frameworks and SQL.
The cheatsheet is loosely based off of The Data Science Design Manual by Steven S. Skiena and An Introduction to Statistical Learning by Gareth James, Daniela Witten, Trevor Hastie and Robert Tibshirani.
Inspired by William Chen's The Only Probability Cheatsheet You'll Ever Need, located here.
- Graph Theory
- Algorithms and Data Structures
- Python
- Advanced SQL (SQL Part II)
- Data Science on the Cloud- AWS/GCP/Azure
- Linear Algebra
- Data Engineering
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
2018-08-13 Added Python Data Structures Section
2018-08-12 Added Feature Engineering Section
2018-08-10: Added Data Science Cheat Sheet
Feel free to suggest comments, updates, and potential improvements!
Maverick Lin: Reach out to me via Quora or through my website. Cheers.