Hyperparameters for the baseline results
shlee94 opened this issue · 1 comments
shlee94 commented
Hello,
I believe this is the github repo for the paper "Benchmarks for Deep Off-Policy Evaluation".
Do you have any plans to release the hyperparameters & setups used for baselines results?
And possibly implementation of the baseline methods considered in the paper?
Thank you!
KenCao2007 commented
Also, it seems they don't have APIs that provide trajectories collected from behavior policy as stated on page 3 in their paper. I find it really difficult to use this benchmark to reproduce the result in their paper.