muellerzr

Passionate about Open Source and Deep Learning, working on 🤗 Accelerate

@HuggingFace

muellerzr's Stars

huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python128k 1.1k 15k25.3k
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Language:Python18.6k 279 2.8k2.6k
stas00/ml-engineering
Machine Learning Engineering Open Book
Language:Python10.1k 103 18593
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Language:Python7.2k 100 1.4k847
NVIDIA/ChatRTX
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
Language:Python2.5k 52 56273
facebookresearch/schedule_free
Schedule-Free Optimization in PyTorch
Language:Python1.5k 15 1948
pytorch/torchtitan
A native PyTorch Library for large model training
Language:Python1.2k 28 89111
microsoft/mup
maximal update parametrization (µP)
Language:Jupyter Notebook1.2k 29 5887
jiaweizzhao/GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Language:Python1.2k 17 41129
anibali/docker-pytorch
A Docker image for PyTorch
Language:Dockerfile958 23 37225
tinygrad/open-gpu-kernel-modules
NVIDIA Linux open GPU with P2P support
Language:C751 11 556
stas00/the-art-of-debugging
The Art of Debugging
Language:C734 16 028
pytorch/PiPPy
Pipeline Parallelism for PyTorch
Language:Python653 36 24877
pacman100/LLM-Workshop
LLM Workshop by Sourab Mangrulkar
Language:Jupyter Notebook284 6 16108
bigcode-project/starcoder2-self-align
StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation
Language:Python200 5 512
muellerzr/minimal-trainer-zoo
Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines
Language:Python194 4 113
lucidrains/recurrent-interface-network-pytorch
Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in Pytorch
Language:Python188 11 1614
SergioMEV/slurm-for-dummies
A dummy's guide to setting up (and using) HPC clusters on Ubuntu 22.04LTS using Slurm and Munge. Created by the Quant Club @ UIowa.
114 3 1216
LukasHedegaard/pytorch-benchmark
Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption
Language:Python81 4 79
xrsrke/pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
Language:Python75 4 3217
fchollet/namex
Clean up the public namespace of your package!
Language:Python51 3 13
Youhe-Jiang/IJCAI2023-OptimalShardedDataParallel
[IJCAI2023] An automated parallel training system that combines the advantages from both data and model parallelism. If you have any interests, please visit/star/fork https://github.com/Youhe-Jiang/OptimalShardedDataParallel
Language:Python50 1 04
muellerzr/nbquarto
Small python library solely for quick Quarto extensions
Language:Python20 1 33
gnovack/distributed-training-and-deepspeed
Language:Python13 2 02
muellerzr/RAG-Experiments
My learnings (publicly) on RAG systems
Language:Python13 6 0
TJ-Solergibert/transformers-in-supercomputers
Transformers training in a supercomputer with the 🤗 Stack and Slurm
Language:Python12 2 00
BenjaminBossan/pytest-guide
Pytest guide for unittest users
Language:Python61
muellerzr/llama-3-8b-self-align
StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation applied to llama 3 8b
Language:Python6 0 01
muellerzr/swe-study-group
Code for the SWE study group
Language:Python6 2 01
lessw2020/hyper_efficient_optimizers
Development of hyper efficient optimizers that can match/exceed AdamW, while using reduced memory
Language:Python5 2 11

muellerzr

muellerzr's Stars

huggingface/transformers

huggingface/datasets

stas00/ml-engineering

huggingface/accelerate

NVIDIA/ChatRTX

facebookresearch/schedule_free

pytorch/torchtitan

microsoft/mup

jiaweizzhao/GaLore

anibali/docker-pytorch

tinygrad/open-gpu-kernel-modules

stas00/the-art-of-debugging

pytorch/PiPPy

pacman100/LLM-Workshop

bigcode-project/starcoder2-self-align

muellerzr/minimal-trainer-zoo

lucidrains/recurrent-interface-network-pytorch

SergioMEV/slurm-for-dummies

LukasHedegaard/pytorch-benchmark

xrsrke/pipegoose

fchollet/namex

Youhe-Jiang/IJCAI2023-OptimalShardedDataParallel

muellerzr/nbquarto

gnovack/distributed-training-and-deepspeed

muellerzr/RAG-Experiments

TJ-Solergibert/transformers-in-supercomputers

BenjaminBossan/pytest-guide

muellerzr/llama-3-8b-self-align

muellerzr/swe-study-group

lessw2020/hyper_efficient_optimizers