konstantinator's Stars
huggingface/deep-rl-class
This repo contains the Hugging Face Deep Reinforcement Learning Course.
huggingface/diffusion-models-class
Materials for the Hugging Face Diffusion Models Course
tinkoff-ai/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
srush/LLM-Training-Puzzles
What would you do with 1000 H100s...
fbeilstein/machine_learning