guillaumeboniface/reacher
Reinforcement learning agents for continuous action spaces, including PPO and DDPG, implemented for the Reacher environment
Jupyter Notebook
Reinforcement learning agents for continuous action spaces, including PPO and DDPG, implemented for the Reacher environment
Jupyter Notebook