zcchenvy

RL 和 multiagent RL领域的一个白痴

Dalian Maritime UniversityDalian

zcchenvy's Stars

d2l-ai/d2l-zh
《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
Language:Python61.4k 1.1k 010.9k
google/or-tools
Google's Operations Research tools:
Language:C++11k 304 2.9k2.1k
datawhalechina/easy-rl
强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/
Language:Jupyter Notebook9.1k 78 1431.8k
google-deepmind/mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
Language:Jupyter Notebook7.8k 103 1.5k780
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python5.3k 35 182604
AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
Language:Python3.6k 51 252833
kangjianwei/Data-Structure
《数据结构》-严蔚敏.吴伟民-教材源码与习题解析
Language:C3.5k 68 31987
boyu-ai/Hands-on-RL
https://hrl.boyuai.com/
Language:Jupyter Notebook2.3k 15 82515
marlbenchmark/on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
Language:Python1.3k 7 90289
Farama-Foundation/Metaworld
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
Language:Python1.2k 29 211269
hanjuku-kaso/awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
904 45 187
LeechanX/Data-Structures-and-Algorithms-in-C
所有基础数据结构和算法的纯C语言实现，如各自排序、链表、栈、队列、各种树以及应用、图算法、字符串匹配算法、回溯、并查集等，献丑了
Language:C836 49 0331
LucasAlegre/sumo-rl
Reinforcement Learning environments for Traffic Signal Control with SUMO. Compatible with Gymnasium, PettingZoo, and popular RL libraries.
Language:Python698 11 172191
TianhongDai/reinforcement-learning-algorithms
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Language:Python662 15 10109
tencent-ailab/hok_env
Honor of Kings AI Open Environment of Tencent
Language:Python616 16 6372
MishaLaskin/curl
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
Language:Python569 11 2687
metaopt/torchopt
TorchOpt is an efficient library for differentiable optimization built upon PyTorch.
Language:Python528 12 3735
marlbenchmark/off-policy
PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.
Language:Python387 3 1267
Pi-Star-Lab/RESCO
Reinforcement Learning Benchmarks for Traffic Signal Control (RESCO)
Language:Python114 4 2136
poetries/Data-Structure
数据结构学习笔记
Language:C99 8 166
lich14/CDS
[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
Language:Python83 1 1120
gwthomas/force
A library for reinforcement learning research
Language:Python54 10 18
skumar9876/FCRL
Implementation of "Federated Control with Hierarchical Multi-Agent Deep Reinforcement Learning" (https://arxiv.org/pdf/1712.08266.pdf)
Language:Python36 5 111
PKU-RL/CORRO
CORRO code
Language:Python33 0 46
wingsweihua/gym_cityflow
Adds CityFlow to Gym
Language:Python28 1 015
seong-hun/fym
Flight simulator for various purpose
Language:Python16 6 959
xinghua-qu/Importance-Prioritized-Policy-Distillation
Source code for paper: Importance Adaptive Policy Distillation
Language:Python7 1 10
ZJU-DAI/DAI-Documents
Documents for DAI group
Language:Jupyter Notebook3 9 02
yyds-xtt/AdaRL-code
Implementation codes and datasets used in ICLR'22 Spotlight paper AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning.
Language:Python1 0 0
zcchenvy/d2l-zh
《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被55个国家的300所大学用于教学。
Language:Python1 0 01

zcchenvy

zcchenvy's Stars

d2l-ai/d2l-zh

google/or-tools

datawhalechina/easy-rl

google-deepmind/mujoco

vwxyzjn/cleanrl

AI4Finance-Foundation/ElegantRL

kangjianwei/Data-Structure

boyu-ai/Hands-on-RL

marlbenchmark/on-policy

Farama-Foundation/Metaworld

hanjuku-kaso/awesome-offline-rl

LeechanX/Data-Structures-and-Algorithms-in-C

LucasAlegre/sumo-rl

TianhongDai/reinforcement-learning-algorithms

tencent-ailab/hok_env

MishaLaskin/curl

metaopt/torchopt

marlbenchmark/off-policy

Pi-Star-Lab/RESCO

poetries/Data-Structure

lich14/CDS

gwthomas/force

skumar9876/FCRL

PKU-RL/CORRO

wingsweihua/gym_cityflow

seong-hun/fym

xinghua-qu/Importance-Prioritized-Policy-Distillation

ZJU-DAI/DAI-Documents

yyds-xtt/AdaRL-code

zcchenvy/d2l-zh