Hyperparameters for the baseline results

Question

Hyperparameters for the baseline results

shlee94 opened this issue 4 years ago · 1 comments

shlee94 commented 4 years ago

Hello,

I believe this is the github repo for the paper "Benchmarks for Deep Off-Policy Evaluation".

Do you have any plans to release the hyperparameters & setups used for baselines results?

And possibly implementation of the baseline methods considered in the paper?

Thank you!

Answer 1 · 2023-08-15T22:26:51.000Z

Also, it seems they don't have APIs that provide trajectories collected from behavior policy as stated on page 3 in their paper. I find it really difficult to use this benchmark to reproduce the result in their paper.