jamesliu/nanoPPO

An efficient implementation of the Proximal Policy Optimization (PPO) algorithm with linear and attention policy for reinforcement learning.

PythonApache-2.0

Stargazers