konstantinator

konstantinator's Stars

huggingface/deep-rl-class
This repo contains the Hugging Face Deep Reinforcement Learning Course.
Language:MDX4.2k 80 327655
huggingface/diffusion-models-class
Materials for the Hugging Face Diffusion Models Course
Language:Jupyter Notebook3.9k 92 30430
tinkoff-ai/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Language:Python1.2k 17 28143
srush/LLM-Training-Puzzles
What would you do with 1000 H100s...
Language:Jupyter Notebook1k 12 465
fbeilstein/machine_learning
Language:Jupyter Notebook171 12 144