/drlnd-p2-Continuous_Control

This is the 2nd project in Udacity DRLND, which is practice for training an agent that controls a robotic arm in Unity's Reacher environment using the Deep Deterministic Policy Gradients (DDPG) algorithm.

Primary LanguageJupyter Notebook

Watchers