jjkke88/RL_toolbox
reinfore learning tool box, contains trpo, a3c algorithm for continous action space
PythonMIT
Stargazers
- a75c6
- AirSmithX
- bjnortierCape Town
- brightgemsShanghai
- Chinese-XuAugmentum
- DacrolConsid
- digshock
- esafakArchipelago AI
- falconzyxUniversity of Alberta
- fly51flyPRIS
- forrestbingAlibaba Inc
- g-chang
- gurusuraSura Systems Private Limited
- ioriiod0
- jjkke88tsinghua university
- junkyul
- liangkaicandytalk
- luzilon1
- MathematicalModels
- muupan@pfnet
- nanxintinHuawei Noah's Ark Lab
- pakchoiChina
- ruotianluoWaymo
- ryanliao360
- SeekPoint
- shareeff
- skeeetBay Area
- ucaiadoSão Paulo, Brazil
- wangxiao5791509Anhui University (安徽大学)
- ww880412
- yhyu13Shanghai, China
- yiminglin-ai@Realeyes
- yimingpengWetaFX
- zhiyueGuangzhou
- zmoon111horizon-robotics
- zsdonghaoPeking University