brain-research/mirage-rl-bpttv
Fork of https://github.com/wgrathwohl/BackpropThroughTheVoidRL with modifications for the paper "The Mirage of Action-Dependent Baselines in Reinforcement Learning".
PythonMIT
Fork of https://github.com/wgrathwohl/BackpropThroughTheVoidRL with modifications for the paper "The Mirage of Action-Dependent Baselines in Reinforcement Learning".
PythonMIT