quantumiracle/Popular-RL-Algorithms
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
Jupyter NotebookApache-2.0
Stargazers
- Abhipanda4
- awarebayes
- chjq201410695
- ConfuseziusUniversity of Tuebingen
- fjibjNanjing
- fly51flyPRIS
- hyc6668378东京
- JDvorakKunai
- kashifBerlin, Germany
- kawaiibilli
- keiohtaTokyo, Japan
- liuanji
- liuzhenqi77@netneurolab
- maosengshulei
- MaraniMatias@Qutap @deeplearningrosario
- Maxpridy@netmarble
- mh-ha
- mingbocuiZürich
- normarkUoB
- nttrungmtWinona, MN 55987, USA
- RexKing6Hangzhou
- SohojoeMicrosoft
- SSinyuNEXON
- StepNeverStopNanjing University
- Supertrampchaochao
- vvanirudhPittsburgh
- wangjail
- wangxiao5791509Anhui University (安徽大学)
- wangy12Canada
- wkentaro@mujin
- xiachenfengBaidu.Inc.
- xiaoikerStanford Univerisity
- yeatesUniversity of Rochester
- yueboyan
- Zhanyu-WangPurdue University
- zhiyueGuangzhou