/policy-gradient

vanilla policy gradient on pong and maybe more

Primary LanguageJupyter Notebook

Pong using Policy Gradient Method

My implementation of pong from pixels...thanks Kaparthy!