Create reward function for RL

Question

Closed this issue 9 years ago · 0 comments

Potentially:

if throw:
    return - distance - max(distance)
else:
    return distance

Do we need a differentiable one instead?