Reference Materials for System Design Interview

Chapter 1: Introduction and Overview

Chapter 2: Visual Search System

Chapter 3: Google Street View Blurring System

[1] Google Street View. https://www.google.com/streetview.
[2] DETR. https://github.com/facebookresearch/detr.
[3] RCNN family. https://lilianweng.github.io/posts/2017-12-31-object-recognition-part-3.
[4] Fast R-CNN paper. https://arxiv.org/pdf/1504.08083.pdf.
[5] Faster R-CNN paper. https://arxiv.org/pdf/1506.01497.pdf.
[6] YOLO family. https://pyimagesearch.com/2022/04/04/introduction-to-the-yolo-family.
[7] SSD. https://jonathan-hui.medium.com/ssd-object-detection-single-shot-multibox-detector-for-real-time-processing-9bd8deac0e06.
[8] Data augmentation techniques. https://www.kaggle.com/getting-started/190280.
[9] CNN. https://en.wikipedia.org/wiki/Convolutional_neural_network.
[10] Object detection details. https://dudeperf3ct.github.io/object/detection/2019/01/07/Mystery-of-Object-Detection.
[11] Forward pass and backward pass. https://www.youtube.com/watch?v=qzPQ8cEsVK8.
[12] MSE. https://en.wikipedia.org/wiki/Mean_squared_error.
[13] Log loss. https://en.wikipedia.org/wiki/Cross_entropy.
[14] Pascal VOC. http://host.robots.ox.ac.uk/pascal/VOC/voc2008/index.html.
[15] COCO dataset evaluation. https://cocodataset.org/\#detection-eval.
[16] Object detection evaluation. https://github.com/rafaelpadilla/Object-Detection-Metrics.
[17] NMS. https://en.wikipedia.org/wiki/NMS.
[18] Pytorch implementation of NMS. https://learnopencv.com/non-maximum-suppression-theory-and-implementation-in-pytorch/.
[19] Recent object detection models. https://viso.ai/deep-learning/object-detection/.
[20] Distributed training in Tensorflow. https://www.tensorflow.org/guide/distributed_training.
[21] Distributed training in Pytorch. https://pytorch.org/tutorials/beginner/dist_overview.html.
[22] GDPR and ML. https://www.oreilly.com/radar/how-will-the-gdpr-impact-machine-learning.
[23] Bias and fairness in face detection. http://sibgrapi.sid.inpe.br/col/sid.inpe.br/sibgrapi/2021/09.04.19.00/doc/103.pdf.
[24] AI fairness. https://www.kaggle.com/code/alexisbcook/ai-fairness.
[25] Continual learning. https://towardsdatascience.com/how-to-apply-continual-learning-to-your-machine-learning-models-4754adcd7f7f.
[26] Active learning. https://en.wikipedia.org/wiki/Active_learning_(machine_learning).
[27] Human-in-the-loop ML. https://arxiv.org/pdf/2108.00941.pdf.

Chapter 4: YouTube Video Search

Chapter 5: Harmful Content Detection

Chapter 6: Video Recommendation System

[1] YouTube recommendation system. https://blog.youtube/inside-youtube/on-youtubes-recommendation-system.
[2] DNN for YouTube recommendation. https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/45530.pdf.
[3] CBOW paper. https://arxiv.org/pdf/1301.3781.pdf.
[4] BERT paper. https://arxiv.org/pdf/1810.04805.pdf.
[5] Matrix factorization. https://developers.google.com/machine-learning/recommendation/collaborative/matrix.
[6] Stochastic gradient descent. https://en.wikipedia.org/wiki/Stochastic_gradient_descent.
[7] WALS optimization. https://fairyonice.github.io/Learn-about-collaborative-filtering-and-weighted-alternating-least-square-with-tensorflow.html.
[8] Instagram multi-stage recommendation system. https://ai.facebook.com/blog/powered-by-ai-instagrams-explore-recommender-system/.
[9] Exploration and exploitation trade-offs. https://en.wikipedia.org/wiki/Multi-armed_bandit.
[10] Bias in AI and recommendation systems. https://www.searchenginejournal.com/biases-search-recommender-systems/339319/#close.
[11] Ethical concerns in recommendation systems. https://link.springer.com/article/10.1007/s00146-020-00950-y.
[12] Seasonality in recommendation systems. https://www.computer.org/csdl/proceedings-article/big-data/2019/09005954/1hJsfgT0qL6.
[13] A multitask ranking system. https://daiwk.github.io/assets/youtube-multitask.pdf.
[14] Benefit from a negative feedback. https://arxiv.org/abs/1607.04228?context=cs.

Chapter 7: Event Recommendation System

[1] Learning to rank methods. https://livebook.manning.com/book/practical-recommender-systems/chapter-13/53.
[2] RankNet paper. https://icml.cc/2015/wp-content/uploads/2015/06/icml_ranking.pdf.
[3] LambdaRank paper. https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/lambdarank.pdf.
[4] LambdaMART paper. https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/MSR-TR-2010-82.pdf.
[5] SoftRank paper. https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/SoftRankWsdm08Submitted.pdf.
[6] ListNet paper. https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/tr-2007-40.pdf.
[7] AdaRank paper. https://dl.acm.org/doi/10.1145/1277741.1277809.
[8] Batch processing vs stream processing. https://www.confluent.io/learn/batch-vs-real-time-data-processing/#:~:text=Batch%20processing%20is%20when%20the,data%20flows%20through%20a%20system.
[9] Leveraging location data in ML systems. https://towardsdatascience.com/leveraging-geolocation-data-for-machine-learning-essential-techniques-192ce3a969bc#:~:text=Location%20data%20is%20an%20important,based%20on%20your%20customer%20data.
[10] Logistic regression. https://www.youtube.com/watch?v=yIYKR4sgzI8.
[11] Decision tree. https://careerfoundry.com/en/blog/data-analytics/what-is-a-decision-tree/.
[12] Random forests. https://en.wikipedia.org/wiki/Random_forest.
[13] Bias/variance trade-off. http://www.cs.cornell.edu/courses/cs578/2005fa/CS578.bagging.boosting.lecture.pdf.
[14] AdaBoost. https://en.wikipedia.org/wiki/AdaBoost.
[15] XGBoost. https://xgboost.readthedocs.io/en/stable/.
[16] Gradient boosting. https://machinelearningmastery.com/gentle-introduction-gradient-boosting-algorithm-machine-learning/.
[17] XGBoost in Kaggle competitions. https://www.kaggle.com/getting-started/145362.
[18] GBDT. https://blog.paperspace.com/gradient-boosting-for-classification/
[19] An introduction to GBDT. https://www.machinelearningplus.com/machine-learning/an-introduction-to-gradient-boosting-decision-trees/.
[20] Introduction to neural networks. https://www.youtube.com/watch?v=0twSSFZN9Mc. [21] Bias issues and solutions in recommendation systems. https://www.youtube.com/watch?v=pPq9iyGIZZ8.
[22] Feature crossing to encode non-linearity. https://developers.google.com/machine-learning/crash-course/feature-crosses/encoding-nonlinearity.
[23] Freshness and diversity in recommendation systems. https://developers.google.com/machine-learning/recommendation/dnn/re-ranking.
[24] Privacy and security in ML. https://www.microsoft.com/en-us/research/blog/privacy-preserving-machine-learning-maintaining-confidentiality-and-preserving-trust/.
[25] Two-sides marketplace unique challenges. https://www.uber.com/blog/uber-eats-recommending-marketplace/.
[26] Data leakage. https://machinelearningmastery.com/data-leakage-machine-learning/.
[27] Online training frequency. https://huyenchip.com/2022/01/02/real-time-machine-learning-challenges-and-solutions.html#towards-continual-learning.

Chapter 8: Ad Click Prediction on Social Platforms

[1] Addressing delayed feedback. https://arxiv.org/pdf/1907.06558.pdf.
[2] AdTech basics. https://advertising.amazon.com/library/guides/what-is-adtech.
[3] SimCLR paper. https://arxiv.org/pdf/2002.05709.pdf.
[4] Feature crossing. https://developers.google.com/machine-learning/crash-course/feature-crosses/video-lecture.
[5] Feature extraction with GBDT. https://towardsdatascience.com/feature-generation-with-gradient-boosted-decision-trees-21d4946d6ab5.
[6] DCN paper. https://arxiv.org/pdf/1708.05123.pdf.
[7] DCN V2 paper. https://arxiv.org/pdf/2008.13535.pdf.
[8] Microsoft’s deep crossing network paper. https://www.kdd.org/kdd2016/papers/files/adf0975-shanA.pdf.
[9] Factorization Machines. https://www.jefkine.com/recsys/2017/03/27/factorization-machines/.
[10] Deep Factorization Machines. https://d2l.ai/chapter_recommender-systems/deepfm.html.
[11] Kaggle’s winning solution in ad click prediction. https://www.youtube.com/watch?v=4Go5crRVyuU.
[12] Data leakage in ML systems. https://machinelearningmastery.com/data-leakage-machine-learning/.
[13] Time-based dataset splitting. https://www.linkedin.com/pulse/time-based-splitting-determining-train-test-data-come-manraj-chalokia/?trk=public_profile_article_view.
[14] Model calibration. https://machinelearningmastery.com/calibrated-classification-model-in-scikit-learn/.
[15] Field-aware Factorization Machines. https://www.csie.ntu.edu.tw/~cjlin/papers/ffm.pdf.
[16] Catastrophic forgetting problem in continual learning. https://www.cs.uic.edu/~liub/lifelong-learning/continual-learning.pdf.

Chapter 9: Similar Listings on Vacation Rental Platforms

[1] Instagram’s Explore recommender system. https://ai.facebook.com/blog/powered-by-ai-instagrams-explore-recommender-system.
[2] Listing embeddings in search ranking. https://medium.com/airbnb-engineering/listing-embeddings-for-similar-listing-recommendations-and-real-time-personalization-in-search-601172f7603e.
[3] Word2vec. https://en.wikipedia.org/wiki/Word2vec.
[4] Negative sampling technique. https://www.baeldung.com/cs/nlps-word2vec-negative-sampling.
[5] Positional bias. https://eugeneyan.com/writing/position-bias/.
[6] Random walk. https://en.wikipedia.org/wiki/Random_walk.
[7] Random walk with restarts. https://www.youtube.com/watch?v=HbzQzUaJ_9I.
[8] Seasonality in recommendation systems. https://www.computer.org/csdl/proceedings-article/big-data/2019/09005954/1hJsfgT0qL6.

Chapter 10: Personalized News Feed

[1] News Feed ranking in Facebook. https://engineering.fb.com/2021/01/26/ml-applications/news-feed-ranking/.
[2] Twitter’s news feed system. https://blog.twitter.com/engineering/en_us/topics/insights/2017/using-deep-learning-at-scale-in-twitters-timelines.
[3] LinkedIn’s News Feed system LinkedIn. https://engineering.linkedin.com/blog/2020/understanding-feed-dwell-time.
[4] BERT paper. https://arxiv.org/pdf/1810.04805.pdf.
[5] ResNet model. https://arxiv.org/pdf/1512.03385.pdf.
[6] CLIP model. https://openai.com/blog/clip/.
[7] Viterbi algorithm. https://en.wikipedia.org/wiki/Viterbi_algorithm.
[8] TF-IDF. https://en.wikipedia.org/wiki/Tf%E2%80%93idf.
[9] Word2vec. https://en.wikipedia.org/wiki/Word2vec.
[10] Serving a billion personalized news feed. https://www.youtube.com/watch?v=Xpx5RYNTQvg.
[11] Mean absolute error loss. https://en.wikipedia.org/wiki/Mean_absolute_error.
[12] Means squared error loss. https://en.wikipedia.org/wiki/Mean_squared_error.
[13] Huber loss. https://en.wikipedia.org/wiki/Huber_loss.
[14] A news feed system design. https://liuzhenglaichn.gitbook.io/system-design/news-feed/design-a-news-feed-system.
[15] Predict viral tweets. https://towardsdatascience.com/using-data-science-to-predict-viral-tweets-615b0acc2e1e.
[16] Cold start problem in recommendation systems. https://en.wikipedia.org/wiki/Cold_start_(recommender_systems).
[17] Positional bias. https://eugeneyan.com/writing/position-bias/.
[18] Determine retraining frequency. https://huyenchip.com/2022/01/02/real-time-machine-learning-challenges-and-solutions.html#towards-continual-learning.

Chapter 11: People You May Know

[1] Clustering in ML. https://developers.google.com/machine-learning/clustering/overview.
[2] PYMK on Facebook. https://youtu.be/Xpx5RYNTQvg?t=1823.
[3] Graph convolutional neural networks. http://tkipf.github.io/graph-convolutional-networks/.
[4] GraphSage paper. https://cs.stanford.edu/people/jure/pubs/graphsage-nips17.pdf.
[5] Graph attention networks. https://arxiv.org/pdf/1710.10903.pdf.
[6] Graph isomorphism network. https://arxiv.org/pdf/1810.00826.pdf.
[7] Graph neural networks. https://distill.pub/2021/gnn-intro/.
[8] Personalized random walk. https://www.youtube.com/watch?v=HbzQzUaJ_9I.
[9] LinkedIn’s PYMK system. https://engineering.linkedin.com/blog/2021/optimizing-pymk-for-equity-in-network-creation.
[10] Addressing delayed feedback. https://arxiv.org/pdf/1907.06558.pdf

iwhsin/ml-bytebytego

Reference Materials for System Design Interview

Chapter 1: Introduction and Overview

Chapter 2: Visual Search System

Chapter 3: Google Street View Blurring System

Chapter 4: YouTube Video Search

Chapter 5: Harmful Content Detection

Chapter 6: Video Recommendation System

Chapter 7: Event Recommendation System

Chapter 8: Ad Click Prediction on Social Platforms

Chapter 9: Similar Listings on Vacation Rental Platforms

Chapter 10: Personalized News Feed

Chapter 11: People You May Know