sanmuyang's Stars
thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
PaddlePaddle/PARL
A high-performance distributed training framework for Reinforcement Learning
LyWangPX/Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions
Solutions of Reinforcement Learning, An Introduction
oxwhirl/pymarl
Python Multi-Agent Reinforcement Learning framework
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
oxwhirl/smac
SMAC: The StarCraft Multi-Agent Challenge
tinkoff-ai/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
hanjuku-kaso/awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
NtrQQ/download
hijkzzz/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
sanmuyang/multi-agent-PPO-on-SMAC
Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.
Facebear-ljx/DOGE
The official implementation of "When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning" (ICLR2023)
ryanxhr/DWBC
[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"