/ProximalPolicyOptimizationKeras

This is a deterministic Tensorflow 2.0 (keras) implementation of a Open Ai's proximal policy optimization actor critic algorithm PPO.

Primary LanguagePython

Watchers