rl

There are 1137 repositories under rl topic.

LlamaFamily/Llama-Chinese
Llama中文社区，实时汇总最新Llama学习资料，构建最好的中文Llama大模型开源生态，完全开源可商用
Language:Python14.7k 145 3401.3k
google/dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Language:Jupyter Notebook10.8k 414 1751.4k
thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
Language:Python8.9k 89 7731.2k
OpenPipe/ART
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
Language:Python7.8k 43 101599
junxiaosong/AlphaZero_Gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Language:Python3.5k 99 1261k
pytorch/ELF
ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation
Language:C++3.4k 184 132571
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Language:Python3.2k 40 752418
inclusionAI/AReaL
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
Language:Python3k 25 126225
werner-duvaud/muzero-general
MuZero
Language:Python2.7k 72 175663
DLR-RM/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Language:Python2.6k 21 262574
IntelLabs/coach
Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
Language:Python2.4k 120 264463
TsinghuaC3I/Awesome-RL-for-LRMs
A Survey of Reinforcement Learning for Large Reasoning Models
2k 12 14111
PRIME-RL/PRIME
Scalable RL solution for advanced reasoning of language models
Language:Python1.8k 10 4299
MaximeVandegar/Papers-in-100-Lines-of-Code
Implementation of papers in 100 lines of code.
Language:Python1.7k 26 12174
pathak22/noreward-rl
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
Language:Python1.5k 61 43306
zzli2022/Awesome-System2-Reasoning-LLM
Latest Advances on System-2 Reasoning
Language:Python1.3k 12 1173
FareedKhan-dev/all-rl-algorithms
Implementation of all RL algorithms in a simpler way
Language:Jupyter Notebook1.2k 18 1218
araffin/rl-baselines-zoo
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
Language:Python1.2k 30 86209
sail-sg/understand-r1-zero
Understanding R1-Zero-Like Training: A Critical Perspective
Language:Python1.1k 12 2754
JudgmentLabs/judgeval
The open source post-building layer for agents. Our environment data and evals power agent post-training (RL, SFT) and monitoring.
Language:Python1k 6 6587
PRIME-RL/SimpleVLA-RL
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
Language:Python95746
MushroomRL/mushroom-rl
Python library for Reinforcement Learning.
Language:Python913 20 59154
PRIME-RL/TTRL
[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning
Language:Python888 13 4265
google-research/rliable
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
Language:Jupyter Notebook852 10 2149
erlerobot/gym-gazebo
Refer to https://github.com/AcutronicRobotics/gym-gazebo2 for the new version
Language:Python846 49 0281
google-research/seed_rl
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
Language:Python834 41 75148
derisk-ai/OpenDerisk
AI-Native Risk Intelligence Systems, OpenDeRisk——Your application system risk intelligent manager provides 7* 24-hour comprehensive and in-depth protection.
Language:Python74886
ashishpatel26/Real-time-ML-Project
A curated list of applied machine learning and data science notebooks and libraries across different industries.
732 10 0271
zeroth-robotics/zeroth-bot
3D-printed open-source humanoid robot platform for sim-to-real and RL
Language:Rust724 22 40112
araffin/rl-tutorial-jnrr19
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
Language:Jupyter Notebook712 10 13135
benedekrozemberczki/awesome-monte-carlo-tree-search-papers
A curated list of Monte Carlo tree search papers with implementations.
Language:Python685 27 476
utilForever/RosettaStone
Hearthstone simulator using C++ with some reinforcement learning
Language:C++666 19 36780
Stable-Baselines-Team/stable-baselines3-contrib
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
Language:Python658 17 164220
yrlu/irl-imitation
Implementation of Inverse Reinforcement Learning (IRL) algorithms in Python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL
Language:Python648 27 5148
peteanderson80/Matterport3DSimulator
AI Research Platform for Reinforcement Learning from Real Panoramic Images.
Language:C++640 18 113136
neptune-ai/neptune-client
📘 The experiment tracker for foundation model training
Language:Python620 20 25266

rl

LlamaFamily/Llama-Chinese

google/dopamine

thu-ml/tianshou

OpenPipe/ART

junxiaosong/AlphaZero_Gomoku

pytorch/ELF

pytorch/rl

inclusionAI/AReaL

werner-duvaud/muzero-general

DLR-RM/rl-baselines3-zoo

IntelLabs/coach

TsinghuaC3I/Awesome-RL-for-LRMs

PRIME-RL/PRIME

MaximeVandegar/Papers-in-100-Lines-of-Code

pathak22/noreward-rl

zzli2022/Awesome-System2-Reasoning-LLM

FareedKhan-dev/all-rl-algorithms

araffin/rl-baselines-zoo

sail-sg/understand-r1-zero

JudgmentLabs/judgeval

PRIME-RL/SimpleVLA-RL

MushroomRL/mushroom-rl

PRIME-RL/TTRL

google-research/rliable

erlerobot/gym-gazebo

google-research/seed_rl

derisk-ai/OpenDerisk

ashishpatel26/Real-time-ML-Project

zeroth-robotics/zeroth-bot

araffin/rl-tutorial-jnrr19

benedekrozemberczki/awesome-monte-carlo-tree-search-papers

utilForever/RosettaStone

Stable-Baselines-Team/stable-baselines3-contrib

yrlu/irl-imitation

peteanderson80/Matterport3DSimulator

neptune-ai/neptune-client