keon/policy-gradient
Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras
PythonMIT
Stargazers
- abcp4
- abhigenie92CS@Columbia University
- aijunbaiUC Berkeley
- alex-orlovskyiChernivtsi, Ukraine
- b03902043
- barisserVoleon
- BillySilverNational Taiwan University
- BleyddynSouthern California
- christfan868
- edwardyuNew York, NY
- elvisyjlinMicrosoft
- farizikhwantriTokyo Institute of Technology
- franciskim@80bots
- frankibem
- GuanxiongLiuNew Jersey Institute of Technology
- hedes1992
- hoagy-davis-digges
- hoangcuong2011
- hsmyy
- jihobakeigencapital
- jounimakelaOulu, Finland
- jungrok5@NCSOFT
- keonSan Francisco
- KimEJSeoul, Korea
- likejazzDnotitia
- mehdidcJuelich Supercomputing Center (JSC), Forschungszentrum Jülich GmbH, LAION
- mehulpatel21@jpmorganchase
- peter0749Taiwan
- qqtop
- rhythm92Japan
- seungjuleeNomad
- SunnyLily
- tensortalkYou're on TensorTalk.com!
- toshima
- ucaiadoSão Paulo, Brazil
- vickyliinGliaCloud @livingbio