rl
There are 1137 repositories under rl topic.
LlamaFamily/Llama-Chinese
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
google/dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
OpenPipe/ART
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
junxiaosong/AlphaZero_Gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
pytorch/ELF
ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
inclusionAI/AReaL
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
DLR-RM/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
IntelLabs/coach
Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
TsinghuaC3I/Awesome-RL-for-LRMs
A Survey of Reinforcement Learning for Large Reasoning Models
PRIME-RL/PRIME
Scalable RL solution for advanced reasoning of language models
MaximeVandegar/Papers-in-100-Lines-of-Code
Implementation of papers in 100 lines of code.
pathak22/noreward-rl
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
zzli2022/Awesome-System2-Reasoning-LLM
Latest Advances on System-2 Reasoning
FareedKhan-dev/all-rl-algorithms
Implementation of all RL algorithms in a simpler way
araffin/rl-baselines-zoo
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
sail-sg/understand-r1-zero
Understanding R1-Zero-Like Training: A Critical Perspective
JudgmentLabs/judgeval
The open source post-building layer for agents. Our environment data and evals power agent post-training (RL, SFT) and monitoring.
PRIME-RL/SimpleVLA-RL
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
MushroomRL/mushroom-rl
Python library for Reinforcement Learning.
PRIME-RL/TTRL
[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning
google-research/rliable
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
erlerobot/gym-gazebo
Refer to https://github.com/AcutronicRobotics/gym-gazebo2 for the new version
google-research/seed_rl
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
derisk-ai/OpenDerisk
AI-Native Risk Intelligence Systems, OpenDeRisk——Your application system risk intelligent manager provides 7* 24-hour comprehensive and in-depth protection.
ashishpatel26/Real-time-ML-Project
A curated list of applied machine learning and data science notebooks and libraries across different industries.
zeroth-robotics/zeroth-bot
3D-printed open-source humanoid robot platform for sim-to-real and RL
araffin/rl-tutorial-jnrr19
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
benedekrozemberczki/awesome-monte-carlo-tree-search-papers
A curated list of Monte Carlo tree search papers with implementations.
utilForever/RosettaStone
Hearthstone simulator using C++ with some reinforcement learning
Stable-Baselines-Team/stable-baselines3-contrib
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
yrlu/irl-imitation
Implementation of Inverse Reinforcement Learning (IRL) algorithms in Python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL
peteanderson80/Matterport3DSimulator
AI Research Platform for Reinforcement Learning from Real Panoramic Images.
neptune-ai/neptune-client
📘 The experiment tracker for foundation model training