guillaumeboniface/reacher

Reinforcement learning agents for continuous action spaces, including PPO and DDPG, implemented for the Reacher environment

Jupyter Notebook

Stargazers