/DRL-ManyTor

An performance comparation between Proximal Policy Optimization (PPO) and Asynchronous Advantage Actor Critic (A3C) algorithms.

Primary LanguagePythonMIT LicenseMIT

DRL-ManyTor

An performance comparation between Proximal Policy Optimization (PPO) and Asynchronous Advantage Actor Critic (A3C) algorithms.