Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)
Primary LanguagePython