Implemented from my DQN framework: https://github.com/romainducrocq/DQN-frameworQ
The boosted seamonkai flappying around.
- Train:
python3 train.py -algo PerDuelingDoubleDQNAgent -max_total_steps 9000000
- Observe:
python3 observe.py -d ./save/PerDuelingDoubleDQNAgent_lr0.001_model.pack
- Visualize:
tensorboard --logdir ./logs/train/
- Play:
python3 play.py
make: cd bin/ && bash make.sh
- Apt packages:
apt-get update && apt-get install build-essential libpq-dev libssl-dev openssl libffi-dev sqlite3 libsqlite3-dev libbz2-dev zlib1g-dev cmake
- Python 3.7.m:
m=0 && while wget -q --method=HEAD https://www.python.org/ftp/python/3.7.$((
$m + 1 ))/Python-3.7.$ (($m + 1 )).tar.xz; do m=$ (( $m + 1 )); done && wget https://www.python.org/ftp/python/3.7.$m/Python-3.7.$m.tar.xz && tar xvf Python-3.7.$m.tar.xz && cd Python-3.7.$m && ./configure && make && make altinstall && cd .. && rm -rv Python-3.7.$m.tar.xz Python-3.7.$m
- Venv (venv):
mkdir venv && python3.7 -m venv venv/
source venv/bin/activate
(venv) ... Pip3 packages
deactivate
- Pip3 packages:
(venv) export TMPDIR='/var/tmp'
(venv) pip3 install 'pyglet==1.5.0' gym torch tensorboard 'msgpack==1.0.2' wheel --no-cache-dir