Khrylx/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
PythonMIT
Stargazers
- yilifzfChina
- rohitgajawadaAtlanta, GA
- Sushil-Thapa
- kenfehlingHonolulu
- zdx3578https://twitter.com/createamindcn
- forrestbingHangzhou, Zhejiang
- wuxianliang
- MSC19950601
- zhangpanrobot
- ThinkingHeart
- ZhuoranYang
- fengyanghe
- xsr-thu
- jdc08161063
- codealphago
- wangxiao5791509Hefei, Anhui Province, China
- pollywuhao
- memoiryBeijing
- lgcming
- lan2720China
- IsminoulaBlacksburg, VA
- JIEliteTaipei, Taiwan
- eric-xw
- MathematicalModels
- zhangbonian
- chmxuShanghai
- kwnsiyJapan
- zchen0211Menlo Park, CA
- woodfrog
- aidiaryJapan
- rooa
- scissorsf
- himktTokyo, Japan
- Skorkmaz88Germany
- rapirentTaipei
- zackgowinternet