Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"
Primary LanguagePythonMIT LicenseMIT