/APO

Reproducing the paper: Average-Reward Reinforcement Learning with Trust Region Methods

Primary LanguagePythonMIT LicenseMIT

Stargazers