1jsingh/rl_reacher
Train double-jointed arms to reach target locations using Proximal Policy Optimization (PPO) in Pytorch
Jupyter NotebookMIT
No issues in this repository yet.
Train double-jointed arms to reach target locations using Proximal Policy Optimization (PPO) in Pytorch
Jupyter NotebookMIT
No issues in this repository yet.