zhonghai1995

zhonghai1995's Stars

gxywy/rl-plotter
:sparkles: A plotter for reinforcement learning (RL)
Language:Python20630
google-deepmind/alphastar
Language:Python39750
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
3.9k213
histmeisah/Large-Language-Models-play-StarCraftII
TextStarCraft2,a pure language env which support llms play starcraft2
Language:Python19212
Michael-Beukman/RobocupGym
Reinforcement Learning inside a 3D soccer simulation
Language:Python19
tinkoff-ai/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Language:Python1.1k124
cor3bit/bertsekas-marl
PyTorch Implementation of the Sequential Multiagent Rollout algorithm
Language:Python102
corl-team/xland-minigrid
JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️
Language:Python18915
proroklab/VectorizedMultiAgentSimulator
VMAS is a vectorized differentiable simulator designed for efficient Multi-Agent Reinforcement Learning benchmarking. It is comprised of a vectorized 2D physics engine written in PyTorch and a set of challenging multi-robot scenarios. Additional scenarios can be implemented through a simple and modular interface.
Language:Python31768
Farama-Foundation/Metaworld
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
Language:Python1.2k269
Haichao-Zhang/PEX
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)
Language:Python435
PKU-MARL/DexterousHands
This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym
Language:Python62774
shariqiqbal2810/maddpg-pytorch
PyTorch Implementation of MADDPG (Lowe et. al. 2017)
Language:Python555127
twni2016/pomdp-baselines
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
Language:Python29641
ikostrikov/rlpd
Language:Python20424
vitchyr/viskit
rllab's viskit with some added features
Language:Python7335
google-deepmind/distrax
Language:Python53032
my-yy/s2v_rc
Speech2Vec Reality Check
Language:Python753
RLE-Foundation/rllte
Long-Term Evolution Project of Reinforcement Learning
Language:Python46584
shadps4-emu/shadPS4
PS4 emulator for Windows,Linux,MacOS
Language:C++9.7k543
google-deepmind/optax
Optax is a gradient processing and optimization library for JAX.
Language:Python1.6k180
google/flax
Flax is a neural network library for JAX that is designed for flexibility.
Language:Python6k631
minitorch/minitorch
The full minitorch student suite.
Language:Python1.9k363
facebookresearch/Pearl
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.
Language:Jupyter Notebook2.6k156
karpathy/LLM101n
LLM101n: Let's build a Storyteller
29k1.6k
jayeshs999/sapg
Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)
Language:Jupyter Notebook372
Emerge-Lab/gpudrive
GPU-acceleration of Nocturne via Madrona
Language:Jupyter Notebook19617
mantle2048/rlplot
rlplot is an easy to use and highly encapsulated RL plot library (including basic error bar lineplot and a wrapper to "rliable").
Language:Python263
google-research/rliable
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
Language:Jupyter Notebook75146
denisyarats/drq
DrQ: Data regularized Q
Language:Jupyter Notebook40252