marlbenchmark/off-policy
PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.
PythonMIT
Stargazers
- 0xJchenmempool
- 51616https://vistec.ist/
- akashveluUniversity of California, Berkeley
- Axd12145
- balamir53
- CassidyJJH
- CHH3213
- Chloe4D
- flammingRaven
- funfwo
- happyemoji
- HELL-TO-HEAVEN
- hijkzzzNVIDIA
- huangshiyu13Zhipu AI
- iamwangyabinUK
- jemmiewwwPolyu
- jnbai517
- josecohenca
- kinalmehtaQualcomm
- kkkclearlove
- liuxiaoy16
- luo-li-ba-suo
- MDrW
- michaelperl
- running-mars
- Sobbbbbber
- tranhoangkhuongvn
- wangxinwi
- WentseChen
- WongziseoiZhejiang University
- Wormh0-le
- wwxFromTjuDRL/MAS
- xihuai18Shanghai Jiao Tong University
- yanchang-liang
- zcchenvyDalian Maritime University
- ZiyiLiubirdNanKai Universiy