/DQN

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning.

Primary LanguagePython

DQN

Prerequisites

Training

python main.py

The 200 epoches will take more than 3 days, you can early stop it. The records of training process are in the breakout.csv.

Testing

python main.py --train=False

Saving the best result

python main.py --train=False --save=True

The best result is saved in the directory best_result, we can use ffmpeg to get a video.

Best result

best_result

References

License

MIT License