zhonghai1995's Stars
gxywy/rl-plotter
:sparkles: A plotter for reinforcement learning (RL)
google-deepmind/alphastar
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
histmeisah/Large-Language-Models-play-StarCraftII
TextStarCraft2,a pure language env which support llms play starcraft2
Michael-Beukman/RobocupGym
Reinforcement Learning inside a 3D soccer simulation
tinkoff-ai/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
cor3bit/bertsekas-marl
PyTorch Implementation of the Sequential Multiagent Rollout algorithm
corl-team/xland-minigrid
JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️
proroklab/VectorizedMultiAgentSimulator
VMAS is a vectorized differentiable simulator designed for efficient Multi-Agent Reinforcement Learning benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set of challenging multi-robot scenarios. Additional scenarios can be implemented through a simple and modular interface.
Farama-Foundation/Metaworld
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
Haichao-Zhang/PEX
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)
PKU-MARL/DexterousHands
This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym
shariqiqbal2810/maddpg-pytorch
PyTorch Implementation of MADDPG (Lowe et. al. 2017)
twni2016/pomdp-baselines
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
ikostrikov/rlpd
vitchyr/viskit
rllab's viskit with some added features
google-deepmind/distrax
my-yy/s2v_rc
Speech2Vec Reality Check
RLE-Foundation/rllte
Long-Term Evolution Project of Reinforcement Learning
shadps4-emu/shadPS4
PS4 emulator for Windows,Linux,MacOS
google-deepmind/optax
Optax is a gradient processing and optimization library for JAX.
google/flax
Flax is a neural network library for JAX that is designed for flexibility.
minitorch/minitorch
The full minitorch student suite.
facebookresearch/Pearl
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.
karpathy/LLM101n
LLM101n: Let's build a Storyteller
jayeshs999/sapg
Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)
Emerge-Lab/gpudrive
GPU-acceleration of Nocturne via Madrona
mantle2048/rlplot
rlplot is an easy to use and highly encapsulated RL plot library (including basic error bar lineplot and a wrapper to "rliable").
google-research/rliable
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
denisyarats/drq
DrQ: Data regularized Q