/DRL-2018

Experiments on combining Policy Gradient methods (vanilla PG, Actor-Critic, PPO) with Evolution Strategies.

Primary LanguagePython

Stargazers