/rl_bipedal

Solving OpenAI gym's Bipedal environment using Proximal Policy Optimization (PPO)

Primary LanguageJupyter NotebookMIT LicenseMIT

Stargazers