/Increasing-the-Action-Gap-RL-pytorch

Pytorch implementation of the Persistent Advantage reinforcement learning operator proposed in paper 'Increasing the Action Gap: New Operators for Reinforcement Learning'

Primary LanguagePython

Watchers