Pinned Repositories
lectures
Material for gpu-mode lectures
neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
awesome-profiling
Awesome utilities for performance profiling
C-compiler-optimizations
Description of commonly done compiler optimizations in C
ml-design-patterns
Software Architecture for ML engineers
multiple_dispatch
Why multiple dispatch lets you write composable code
ao
PyTorch native quantization and sparsity for training and inference
examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
serve
Serve, optimize and scale PyTorch models in production
msaroufim's Repositories
msaroufim/ML-devops
Helper scripts I use to run many experiments in the morning to check at night
msaroufim/sixfigurecareer
Pip install yourself to a six figure career!
msaroufim/intermediate-python
An intro for people that want to ship not just read code
msaroufim/openaitritontutorial
msaroufim/pytorch-from-scratch
PyTorch models implemented from scratch
msaroufim/android-demo-app
PyTorch android examples of usage in applications
msaroufim/load_model
msaroufim/accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
msaroufim/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
msaroufim/benchmark
msaroufim/builder
Continuous builder and binary build scripts for pytorch
msaroufim/ci-test
msaroufim/cloud
The TensorFlow Cloud repository provides APIs that will allow to easily go from debugging and training your Keras and TensorFlow code in a local environment to distributed training in the cloud.
msaroufim/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
msaroufim/flyte
Kubernetes-native workflow automation platform for complex, mission-critical data and ML processes at scale. It has been battle-tested at Lyft, Spotify, Freenome, and others and is truly open-source.
msaroufim/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
msaroufim/intel-extension-for-pytorch
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
msaroufim/maml-jax
Implementation of Model-Agnostic Meta-Learning (MAML) in Jax
msaroufim/notebooks
Notebooks using the Hugging Face libraries 🤗
msaroufim/pyscript
msaroufim/rfcs
PyTorch RFCs (experimental)
msaroufim/rich
Rich is a Python library for rich text and beautiful formatting in the terminal.
msaroufim/RogueChess
A roguelike chess game
msaroufim/RoME.jl
Robot Motion Estimate: Tools, Variables, and Factors commonly used for SLAM robotics.
msaroufim/spektral
Graph Neural Networks with Keras and Tensorflow 2.
msaroufim/testsadasdsad
msaroufim/Transformers-Recipe
🧠 A quick recipe to learn all about Transformers
msaroufim/treex
A Pytree Module system for Deep Learning in JAX
msaroufim/unet-pytorch-ipu
msaroufim/website
Kubeflow's public website