Create reward function for RL
Closed this issue · 0 comments
bcaine commented
Potentially:
if throw:
return - distance - max(distance)
else:
return distance
Do we need a differentiable one instead?
Closed this issue · 0 comments
Potentially:
if throw:
return - distance - max(distance)
else:
return distance
Do we need a differentiable one instead?