This repository contains all code and experiments for Trust region Competitive policy optimization (TRCoPO) algorithm.
Primary LanguageJupyter NotebookMIT LicenseMIT