woithook/A2C-Pytorch-implementations

Implement the A2C(Advantage Actor-Critic) algorithm using pytorch in multiple environments of openai gym. (Including Cartpole, LunarLander, Pong. Breakout is tuning and maybe complete soon.) Sometime implement the REINFORCE algorithm as variations of A2C.

Python

Stargazers

toberain