/RL_toolbox

reinfore learning tool box, contains trpo, a3c algorithm for continous action space

Primary LanguagePythonMIT LicenseMIT

Stargazers