aviralkumar2907/CQL

About the derivation in paper

2019ChenGong opened this issue · 1 comments

About the derivation in paper

Thanks for your excellent work!

We have a question in the paper, "Conservative Q-Learning
for Offline Reinforcement Learning", about the proof of Theorem 3.2. In the equation,
.

Why can we know that ?

Thank you!