ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
PythonMIT
Stargazers
- adrepd
- allenye0119
- Cadene@huggingface
- cswhjiangShenzhen, China
- dendisuhubdy@bitwyre
- dexter1691Georgia Tech
- eagle705SKTelecom
- ethancaballero@mila-iqia
- flutist
- fly51flyPRIS
- gmlee
- gujiuxiangAdobe Research
- gurudave
- iidsample
- ink-padIBM Research
- jfsantos@NVIDIA
- Kaixhin@arayabrain
- kelvinxuUC Berkeley
- keonSan Francisco
- lantigaLightning AI
- mabirck@ufpeldatalab
- nakosungNAVER, CLOVA AI
- nanxintinHuawei Noah's Ark Lab
- pemami4911NREL
- PerathamUniversity of Maryland
- pranz24
- rhythm92Japan
- ruotianluoWaymo
- SeanNaren@NVIDIA
- seba-1511Google DeepMind
- tegg89EpiSys Science
- uralik
- wangxiao5791509Anhui University (安徽大学)
- ww880412
- xiaowei-huBeijing, China
- yobibyteOxford