Khrylx/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
PythonMIT
Stargazers
- achaiah
- akansal1
- aleSugliaHeriot-Watt University
- AndersonPeng@MISLab
- beckybaiDuke University
- claudiogreco
- cn3c3pshanghai
- crystalbai
- dexter1691Georgia Tech
- eyadsibai
- G-WangGoogle
- IceClearNanyang Technological University (NTU)
- jithinodattuinferencemachines
- JunhongXu
- Kartikaggarwal98Washington DC
- kevin-zhanglf
- kinopiiiii
- lionelblondeSwitzerland
- magnusja@luminovo
- morikaz0429
- Pengcheng-WangNorth Carolina State University
- pranz24
- rhythm92Japan
- rongzhou
- sanjeevanahilan
- SeekPoint
- shanjgitTaiwan
- shinshinerSJTU
- shubhampachori12110095Somewhere in India
- smrjansTalentica
- speedcell4NICT
- walkacrossShenzhen, China
- wangbx66
- wolegechuStepFUN
- yikaiwTsinghua University
- yinxiaochuan