orrivlin/MountainCar_DQN_RND
Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)
Python
Stargazers
- 450565034
- 514123661
- Ac314159
- ackdavZürich
- annabaliMoscow
- ArcherShirou
- bok7luckKorea
- chaobiubiu
- chl1926792527
- dvu4Chicago
- EgbertB
- ES-labrepo
- firstlast199
- Fraser-GreenleeStealth
- HarryXuancy
- HeZez
- JMian
- jzzzfShanghai Jiaotong University
- kiankyars
- konichuvak
- liigoQiFudan University
- madlsj
- Maicon-MoreiraBlockful
- markub3327University of Ss. Cyril and Methodius in Trnava
- MegaYEyePurdue Univ
- Mingcong-Cao
- misterhuochina
- qingzhu0214
- scottvcaputo
- self-supervisorUniversity College London
- sjYoondeltarSeoul
- speedcell4NICT
- stjordanisGreece
- yjh05025minlab
- youri98
- zoeshao0425