/reinforcement_learning

My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.

Primary LanguageJupyter NotebookMIT LicenseMIT

Stargazers