Deep Q-Network reinforcement learning model implemented for the cartpole environment
Tested on acrobot v1 and averages -90 return after 300 episodes
Deep Q-Network reinforcement learning model implemented for the cartpole environment
Jupyter NotebookMIT