/PPO-PyTorch

Implementation of Proximal Policy Optimization(PPO)

Primary LanguagePython

This is a Pytorch implementation of Proximal Policy Optimization as described in this paper.

The implementation used in this repo was used as a reference for this implementation.

To run a demo, clone the repo and use the command: python simulate.py

The training plots are shown below:

reward plot

actor loss plot

critic loss plot