/rl_reacher

Train double-jointed arms to reach target locations using Proximal Policy Optimization (PPO) in Pytorch

Primary LanguageJupyter NotebookMIT LicenseMIT

No issues in this repository yet.