/trpo

Trust Region Policy Optimization with TensorFlow and OpenAI Gym

Primary LanguageJupyter NotebookMIT LicenseMIT

Stargazers

No one’s star this repository yet.