Implementation of A3C (Asynchronous Advantage Actor-Critic)

This is a tensorflow implementation of Asynchronous advantage actor-critic algorithm for CNN-LSTM as function approximator

Original Paper

Training on Breakout-v0 is done with Nvidia GeForce GTX 1070 GPU for 28 hours

For Training Run:

$ python3 trainer.py

For Demo Run:

$ python3 play.py

Got important help form this project