An unofficial implementation of Google Brain's research in 2018 using tensorflow 1.15.0. Instead of using PPO, we use basic REINFORCE policy gradient algorithm with involving creative idea : depressed feedback.
numpy
tensorflow 1.15.0
keras
PIL
matplotlib
child_net.py
: containing python class representing child network. Can be replaced by any classifer as long as it is a keras model.controller.py
: containing python class representing the RNN controller. Implemented intensorflow 1.x
.data_iterator.py
: containing python class to loadcifar10
dataset. If policy is given, it will automatically apply image operations.run.py
: code to run.transformations.py
: contains functions of image transformations. 16 in total.
python3 run.py