Will anyone want to try RL? How about Eureka method for generating rewards?

Question

Opened this issue 3 months ago · 0 comments

Maybe we can use Eureka-like (https://eureka-research.github.io/) method to generate the reward function? Thanks!