/policy-gradient

Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras

Primary LanguagePythonMIT LicenseMIT

Watchers