zcchenvy's Stars
d2l-ai/d2l-zh
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
google/or-tools
Google's Operations Research tools:
datawhalechina/easy-rl
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
google-deepmind/mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
kangjianwei/Data-Structure
《数据结构》-严蔚敏.吴伟民-教材源码与习题解析
boyu-ai/Hands-on-RL
https://hrl.boyuai.com/
marlbenchmark/on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
Farama-Foundation/Metaworld
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
hanjuku-kaso/awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
LeechanX/Data-Structures-and-Algorithms-in-C
所有基础数据结构和算法的纯C语言实现,如各自排序、链表、栈、队列、各种树以及应用、图算法、字符串匹配算法、回溯、并查集等,献丑了
LucasAlegre/sumo-rl
Reinforcement Learning environments for Traffic Signal Control with SUMO. Compatible with Gymnasium, PettingZoo, and popular RL libraries.
TianhongDai/reinforcement-learning-algorithms
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
tencent-ailab/hok_env
Honor of Kings AI Open Environment of Tencent
MishaLaskin/curl
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
metaopt/torchopt
TorchOpt is an efficient library for differentiable optimization built upon PyTorch.
marlbenchmark/off-policy
PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.
Pi-Star-Lab/RESCO
Reinforcement Learning Benchmarks for Traffic Signal Control (RESCO)
poetries/Data-Structure
数据结构学习笔记
lich14/CDS
[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
gwthomas/force
A library for reinforcement learning research
skumar9876/FCRL
Implementation of "Federated Control with Hierarchical Multi-Agent Deep Reinforcement Learning" (https://arxiv.org/pdf/1712.08266.pdf)
PKU-RL/CORRO
CORRO code
wingsweihua/gym_cityflow
Adds CityFlow to Gym
seong-hun/fym
Flight simulator for various purpose
xinghua-qu/Importance-Prioritized-Policy-Distillation
Source code for paper: Importance Adaptive Policy Distillation
ZJU-DAI/DAI-Documents
Documents for DAI group
yyds-xtt/AdaRL-code
Implementation codes and datasets used in ICLR'22 Spotlight paper AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning.
zcchenvy/d2l-zh
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被55个国家的300所大学用于教学。