google-research/deep_ope

Hyperparameters for the baseline results

shlee94 opened this issue · 1 comments

Hello,

I believe this is the github repo for the paper "Benchmarks for Deep Off-Policy Evaluation".

Do you have any plans to release the hyperparameters & setups used for baselines results?

And possibly implementation of the baseline methods considered in the paper?

Thank you!

Also, it seems they don't have APIs that provide trajectories collected from behavior policy as stated on page 3 in their paper. I find it really difficult to use this benchmark to reproduce the result in their paper.