Proximal Policy Optimization

Classic control

Acrobot-v1 CartPole-v1 MountainCarContinuous-v0 Pendulum-v0

Box2D Roboschool

LunarLanderContinuous-v2 BipedalWalker-v3 RoboschoolAnt-v1 RoboschoolHalfCheetah-v1

This repository contains the source code pytorch realization of PPO for solving openai gym enviroments.

Proximal Policy Optimization is the one of state of the art reinforcement learning algorithm, its main feature is the control of policy changes, we don't want to deviate too much from the old policy when recalculating weights.

All solved environments with hyperparameters are available here opeanai_ppo.ipynb
The code is based on this repository - https://github.com/MrSyee/pg-is-all-you-need.

mandrakedrink/PPO-pytorch

Proximal Policy Optimization

This repository contains the source code pytorch realization of PPO for solving openai gym enviroments.