This repository contains the implementation of DQN + HER. The implementation is tested on the toy problem presented in the paper. Here is a blog post about HER.
The hyperparameters used in this repo are the same as the paper..
python train.py --help
usage: train.py [-h] [-v] [-s S] [-i I] [-e E] [-c C] [-o O]
HER Bit Flipping
optional arguments:
-h, --help show this help message and exit
-v Verbose flag
-s S Size of bit string
-i I Num epochs
-e E Num episodes
-c C Num cycles
-o O Optimization steps
TODO
TODO