/reinforcement_learning_ppo_rnd

Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

Watchers