/drl-policy-gradient-cartpole

PyTorch application of reinforcement learning Policy Gradient algorithms in OpenAI Cartpole - REINFORCE, Actor-Critic, A2C, A3C

Primary LanguagePythonMIT LicenseMIT

Watchers