/-

An implementation of the actor-critic famous algorithm in a distributed way. Based on the following paper: Asynchronous Methods for Deep Reinforcement Learning

Primary LanguagePython

Implementation of the a3c (actor-critic) algorithm on several processes.

Based on the following paper: Asynchronous Methods for Deep Reinforcement Learning.

Comparsion over optimizers and types of network architectures.

alt text

Prerequisites

gym~=0.18.0
numpy~=1.18.1
matplotlib~=3.2.2