google-research/deep_ope

Evaluation Script

zchuning opened this issue · 0 comments

Hi,

Could you release the evaluation script for the benchmark? In particular, it will be very helpful to know which policy seeds/checkpoints are used for each evaluation metric.