/cma-es-reinforcement-learning

CMA-ES based high confidence policy improvement for RL

Primary LanguagePython

Watchers