A 2-d 2-DOF robot arm toy model to demonstrate Deep Deterministic Policy Gradient while training after trained