/blackoak

Reinforcement Learning through Proximal Policy Optimization in Tensorflow 2.2.0

Primary LanguageJupyter Notebook

blackoak

Reinforcement Learning through Proximal Policy Optimization in Tensorflow 2.0, using Generalized Advantage Estimation.