/dip

Proximal Policy Optimisation (PPO) PyTorch implementation for the inverted double pendulum problem.

Primary LanguageJupyter Notebook

No issues in this repository yet.