mahaitongdae/mpg
MPG is originated from the paper "Mixed policy gradient", which also contains a cluster of high-quality implementations of deep reinforcement learning algorithms.
Python
MPG is originated from the paper "Mixed policy gradient", which also contains a cluster of high-quality implementations of deep reinforcement learning algorithms.
Python