[ICML 2024] The offical implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage-guided policy regularization method, in Pytorch
Primary LanguagePythonMIT LicenseMIT
No issues in this repository yet.