/awesomeMLSys

An ML Systems Onboarding list

ML Systems Onboarding Reading List

This is a reading list of papers/videos/repos I've personally found useful as I was ramping up on ML Systems and that I wish more people would just sit and study carefully during their work hours. If you're looking for more recommendations, go through the citations of the below papers and enjoy!

Attention Mechanism

Performance Optimizations

Quantization

Long context length

Sparsity

  • Venom: Vectorized N:M Format for sparse tensor cores when hardware only supports 2:4
  • Megablocks: Efficient Sparse training with mixture of experts
  • ReLu Strikes Back: Really enjoyed this paper as an example of doing model surgery for more efficient inference

Distributed