ChuaCheowHuan/reinforcement_learning
My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.
Jupyter NotebookMIT
Stargazers
- akansal1
- ankurhcu
- BeeGass@USArmyResearchLab
- bryanyuan1LaunchIt
- Carryma11Shanghai,China
- cvanoortUniversity of Vermont
- DavidDB33
- DavideDelVecchio
- EmmaMuhleman1Ascend Investment Management LLC
- ES-labrepo
- Eyunfang
- FrankCCCCCHsinchu
- franz101
- gothefrid
- horoiwa
- huaisifan
- jagatiyakeval
- kimbring2kimbring2
- leigh-johnsonSan Francisco
- liuqingxi23
- lollipopnougat
- LRAbbadeMorgan Stanley
- luchenlei7
- marcusau
- markub3327University of Ss. Cyril and Methodius in Trnava
- ohoneyd
- rouzbe
- SPAMbanana
- SpencerRaw
- sunhaoyuan3310University of Technology and Science of China
- Super-AlphaBeiJing
- vassil-atn
- Xunmenggod
- XUZiteng2020
- YaoHanLin66