ltlhuuu/A2PR

[ICML 2024] Implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage-guided policy regularization method, in Pytorch

PythonMIT

Watchers

drkostas
University of Tennessee, Knoxville
ltlhuuu
National University of Defense Technology
MaXiaoTianGitHub