/ML-Interview

Resources I used for ML Engineer, Applied Scientist and Quant Researcher interviews.

ML-Interview

This is a list of resources I found useful during my preparation for interviews. Broadly speaking, I interviewd for three different profiles: Machine Learning Engineer, Applied Scientist and Quantitative Researcher.

NOTE: All these profiles usually include multiple "traditional" programming/algorithm rounds, and for that, I relied upon mild leetcoding spree, spread over a period of 3 months.

Classical Machine Learning

  • A very good (slighly advanced) course on Machine Learning by Alex Smola. Link
  • Perhaps everything that you'll ever need to know for the interview sake. Link
  • Generative vs Discriminative Classifiers (you should know the difference, and tradeoffs when choosing one over the other) Link
  • Gradient Boosted Trees Link
  • Gentle Introduction to Gradient Boosting Link
  • ROC and AUC (I like this video) Link
  • Clustering (from Ryan Tibshirani's Data Mining course, other slides are really good as well) Link 1 Link 2 Link 3
  • Good old Linear Regression. Link
  • L0, L1 and L2 regularization (Subset Selection, Lasso and Ridge Regression), a comparison. The Elements of Statistical Learning, Trevor Hastie, Robert Tibshirani, Jerome Friedman, 2nd Edition, Section 3.4.3 Link

Deep Learning

  • Why tanh for Recurrent Networks Link
  • Receptive Fields in CNNs Link
  • For everything Convolution Link
  • For eveything Gradient Descent Link
  • Adaptive Learning rates in SGD Link
  • Backpropagation in Python, Andrej Karpathy Link

Probability and Statistics

  • As the title would say, "Generalized Linear Models, abridged".Link
  • A good course to cover Statistics Link
  • Basic Statistics: Introduction to Mathematical Statistics, Hogg, McKean and Craig, Chapters 1-4. Link
  • Introduction to Hypothesis Testing: Introduction to Mathematical Statistics, Hogg, McKean and Craig, Section 4.5-4.6 Link
  • Examples of Uncorrelated vs Independent Random Variable Link
  • Discrete time Markov Chains,Poisson Processes, Renewal Theory Adventures in Stochastic Processes, 2nd Edition, Sidney Resnick Link TODO: Add a link to more succint notes.
  • Q-Q Plots Link

Large Scale Machine Learning

Assorted Mathematics

  • Some facts about Symmetric Matrices. Link
  • Bare minimum SVD by Gilbert Strang. Link
  • SVD and PCA in real-life. Link
  • If you are not sure how SVD and PCA are related. Link
  • If you want to brush up on Chain Rule (or if you are like me and get confused between gradient and derivative notation) Link. Wikipedia has some useful information as well.
  • Collection of Quantitative Interview problems by Pete Benson, University of Michigan. Link
  • Cholesky Factorization Link
  • QR Factorization Link

System Design (for ML)

  • Structure for Google Index Link
  • Recommender Systems, Xavier Amatriain Link 1 Link 2
  • News Feed Ranking @ Facebook (Lars Backstrom) Link

Uncategorized

  • Sobel Operator Link
  • You have a fair Die, and you can choose to roll it up to 3 times. Whenever you decide to stop, the number that’s facing up is your score. What strategy would you choose to maximize your score? What is the expected score with your strategy? If you are given more than 3 chances, can you improve your score?