woithook/A2C-Pytorch-implementations
Implement the A2C(Advantage Actor-Critic) algorithm using pytorch in multiple environments of openai gym. (Including Cartpole, LunarLander, Pong. Breakout is tuning and maybe complete soon.) Sometime implement the REINFORCE algorithm as variations of A2C.
Python