bcaine/maddux

Create reward function for RL

Closed this issue · 0 comments

Potentially:

if throw:
    return - distance - max(distance)
else:
    return distance

Do we need a differentiable one instead?