/simple-A2C-PPO

Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.

Primary LanguageJupyter Notebook