/PolicyBased_DeepRL

Policy based Reinforcement Learning techniques with REINFORCE and Actor Critic, applied to OpenAI's gym environments.

Primary LanguagePython

Watchers