Unofficial Pytroch (1.7+) implementation of the original Deep Reinforcement Learning Course
π ARTICLE // DOOM IMPLEMENTATION
π ARTICLE // CARTPOLE IMPLEMENTATION // DOOM IMPLEMENTATION
π ARTICLE
π ARTICLE
π ARTICLE
π¨βπ» A trained RND agent that learned to play Montezuma's revenge (21 hours of training with a Tesla K80
If you have any questions on theory and Tensorflow implementation, please contact the original author:
π§: simonini.thomas.pro@gmail.com
Github: https://github.com/simoninithomas/Deep_reinforcement_learning_Course