Khrylx/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
PythonMIT
Watchers
- artu001null
- asakuras
- cjy1992Tsinghua University
- codeislife99Waymo
- erschmidtStuttgart, Germany
- hohoCodeUniversity of Maryland College Park
- hushidong
- jhcloos
- justicelee
- kaelgabrielCampinas, São Paulo, Brazil.
- KelvinsonSomewhere
- KhrylxNVIDIA Research
- lliai
- minesh1291@Kaggle
- morikaz0429
- NewEnglandML
- nomoreid
- paper2code-bot@paper2code
- pprivuletAvaloon
- rwill128Atlanta, GA
- suncj
- tatsuya-ishihara
- wicky08
- wuxianliangBeijing No.2 Experimental Primary School
- wx-bRIOS
- YuanXue1993Ohio State University
- zlf0625