Udacity Deep Reinforcement Learning OpenAI Gym LunarLander-v2 project. Original code and project details can be found here.
The baseline agent is a Deep Q-Network with Experience Replay and Fixed Q-Targets. More details found in this paper.
This agent is the same as the baseline agent in every way except Agent.learn now uses the Double Deep Q-Network algorithm.
This agent is the same as the baseline agent with prioritized experience replay added. (Ran too slow, need faster priority replay buffer)