/datasci

Self-study plan to achieve mastery in data science

Zero to Mastery in Data Science.

Study plan overview

  • Module 0 - Elementary to Highschool Math
  • Module 1 - College Math I (Calculus)
  • Module 2 - College Math II (Linear Algebra)
  • Module 3 - College Math III (Discrete Math)
  • Module 4 - College Math IV (Probability and Statistics)
  • Module 5 - Computation and Algorithms
  • Module 6 - Artificial Intelligence and Machine Learning
  • Module 7 - Deep Learning
  • Module 8 - Data Mining and Recommenders
  • Module 9 - NLP and Computer Vision

Module 0 - Elementary to Highschool math

Not everyone was fortunate enough to have a good start with math growing up. The goal of this module is to level the playing field - by the end of module 0 you should feel as though you went to a highschool with world class teachers and finished top of your math class.

If you consider yourself bad at math, or if you "hated math" in school, then the best advice is to start at the lowest level you can. Start at pre-school math if you have to, but find the level of math where you can easily follow. Resist skipping ahead and go through the program level by level. Do not advance to the next level until you have mastery of the current level. If the current level is too hard, go back to an earlier level. I've linked some courses here that are probably a good for most, but you can find even more elementary courses on khanacademy if you need.

Algebra

Geometry

Pre Calculus

Statistics and Probability

Supplementary Material

Module 1 - College Math I (Calculus)

Supplementary Material

Module 2 - College Math II (Linear Algebra)

Required Reading

Supplementary Material

Module 3 - College Math III (Discrete Math)

3.1 Proofs and Logic

Proofs, Set theory, propositional logic, induction, invariants, state-machines

3.2 Number Theory

Number theory is fundamental in reasoning about numbers as discrete mathematic structures with applications in cryptography and efficient numerical computation.

By the end of this sub-module you should be very confident proving and reasoning about concepts including: divisibility, bezouts identity, modular arithmetic, eulers totient theorem, fermats little theorem, integer factorization, diophantine equations, the fundemental theorem of arithmetic, chinese remainder theorem, RSA and the discrete logarithm problem.

Problem Sets

Optional Supplementary Material

3.3 Combinatorics

Combinatorics is a vital skill in reasoning about the size of finite sets.

Problem Sets

3.4 Graph Theory

3.5 Series, Sequences, Recurrences

todo

3.6 Discrete Probability

todo

Discrete Math Supplementary Material

Module 4 - College Math IV (Probability and Statistics)

Module 5 - Computation and Algorithms

Algorithms

Resources

Information Theory

Python and Computation and Data

Module 5.5 - Databases, and Computer Architecture

Supplementary

Module 6 - Artificial Intelligence and Machine Learning

https://www.coursera.org/specializations/aml

Artificial Intelligence

Machine Learning

Machine Learning Specialization by University of Washington on Coursera

Module 7 - Deep Learning

Deep Learning by deeplearning.ai on Coursera

Goals:

  • different activation functions (sigmoid/tanh/relu)
  • different cost functions
  • with and without bias units
  • classification and regression problems
  • text / binary / image / recommenders
  • batch vs stochastic
  • JS, Python, PHP, Matlab, TensorFlow, SciKitLearn
  • create visualizations and blog explanations
  • Audit best courses / books

Module 8 - Data Mining and Recommenders

Module 9 - NLP and Computer Vision

NLP

Image and Computer Vision

Electives

Resources

Reading List