/stabilized-rl

On-policy RL stabilized with KL-loss

Primary LanguageC++

Stargazers