/acrobot

[in-dev] Basic replication of Deep Q Learning on the Acrobot environment, as desribed in Mnih et al.'s "Human-level control through deep reinforcement learning."

Primary LanguagePython

Deep Q Learning Replication

In collaboration with Warren Sunada-Wong.

This repository replicates Deep Q Learning as described in Mnih et al.'s "Human-level control through deep reinforcement learning" on the Acrobot environment from OpenAI's Gym.

Citation

@article{mnih2015human,
  title={Human-level control through deep reinforcement learning},
  author={Mnih, Volodymyr and Kavukcuoglu, Koray and Silver, David and Rusu, Andrei A and Veness, Joel and Bellemare, Marc G and Graves, Alex and Riedmiller, Martin and Fidjeland, Andreas K and Ostrovski, Georg and others},
  journal={nature},
  volume={518},
  number={7540},
  pages={529--533},
  year={2015},
  publisher={Nature Publishing Group}
}