vanilla policy gradient on pong and maybe more
Primary LanguageJupyter Notebook
My implementation of pong from pixels...thanks Kaparthy!