BoxingV0-StableBaselines
An implementation of a DQN trained on Openai Gym's Boxing-ram-v0 using Stable Baselines.
The agent is playing the white character below:
Performance
It starts beating the built-in AI reliably after around the 70,000th episode.
Unfortunately, training got really unreliable (probably because of the vanilla DQN architecture) and performance varied a lot from episode to episode.
The highest-performing agent (saved on episode 149,000) achieved an average score of 40.54 over 37 test episodes.
Training
Want to train your own agent?
git clone
this repository and run the ipython file here.
Alternatively:
- Click here to access the colab file directly.
- Click
File
->Open in Playground
orSave copy to Drive
->Run All
.
Credits
Credit goes to StarAI for Colab visualization code. Check out the Stable Baselines library for awesome reinforcement learning algorithms!