/reinforcement_learning_v_mpo

Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)

Primary LanguagePython

No issues in this repository yet.