karpathy/reinforcejs
Reinforcement Learning Agents in Javascript (Dynamic Programming, Temporal Difference, Deep Q-Learning, Stochastic/Deterministic Policy Gradients)
HTML
Watchers
- astrolopeKoverse
- belkhir-nacim@mirai
- BILLzzz
- bright-spark@bright-sparks
- desperado1992
- fredchenjialin
- gabriel-ozeas
- gabrielcc2University of Magdeburg, OvGU
- jhcloos
- josecohenca
- karpathyStanford
- kgryte@stdlib-js @quansight @data-apis
- kir0ul
- laglekiThe Logical Language Group
- laurencecaoShanghai
- linkerlin
- lt1946
- mantarohunimal-jp
- marcoippolitoMilano
- mcanthonyDΞFCONCΞPTS
- newsbubbles
- nosyndicateDropbox, George Mason University
- Pascal66@Priveyes
- Playinf
- rnuredini
- robertsdionneSan Francisco
- romanab
- roschler
- rwill128Atlanta, GA
- sheshuguang
- strategist922Microsoft
- suqi@eastlakeside
- SurgeonY
- wuxianliangBeijing No.2 Experimental Primary School
- zamberjo@aurestic
- zergskj