Investigation of generative adversarial network training: The effect of hyperparameters on training time and stability
Bachelor Degree Project in Information Technology
IT613G, University of Skövde
Alexander Gustafsson, Jonatan Linberg
Thesis Paper
Generative Adversarial Networks (GAN) is a technique used to learn the distribution of some dataset in order to generate similar data. GAN models are notoriously difficult to train, which has caused limited deployment in the industry. The results of this study can be used to accelerate the process of making GANs production ready.
An experiment was conducted where multiple GAN models were trained, with the hyperparameters Leaky ReLU alpha, convolutional filters, learning rate and batch size as independent variables. A Mann-Whitney U-test was used to compare the training time and training stability of each model to the others’.
Except for the Leaky ReLU alpha, changes to the investigated hyperparameters had a significant effect on the training time and stability. This study is limited to a few hyperparameters and values, a single dataset and few data points, further research in the area could look at the generalisability of the results or investigate more hyperparameters.
- The project is built around the EMNIST dataset, available in python-mnist or from:
- The initial cGAN and FID code (train_cgan.py) was developed by Jason Brownlee, available from:
Training a cGAN can be accomplished by running the train_cgan code, python train_cgan.py <save_path>
. The user can enter runtime parameters when prompted or load them from a file, python train_cgan.py <save_path> <RTP file>
. Alternatively, one can run a multi-model experiment using the run_experience.py-script, however the variables in the script will need to be adjusted for each experiment.
Raw data from the experiment is available in the final experiment folder, grouped by EXMO (00-28) and run (0-2). The loss, accuracy and FID measurements are in a csv plain text file and are most easily viewed with the result plotting tool.
Derived and calculated data (including a Mann-Whitney U-test) was generated with u_test and calc_experiment_data and can be found as csv-files in the data folder. The csv-files have columns for measurements in epochs, these are empty in the csv-files, but can easily be calculated by dividing the batch measurement with 881 (1762 for EXMO27 and 440 for EXMO28).
Latent Space Explorer and Recorder (LaSER.py)
The program features one slider for each latent dimension, for exploration the latent space. The user can define a vector in the space to traverse and even create an animation.
python LaSER.py
and enter the model file when prompted orpython LaSER.py <model.h5>
Character Generator (char_gen.py)
The program can be used to generate characters and text using a generator model. A full list of the many features can be found by using the "-H" option. Can also evaluate the euclidean distance between the generated images and present them as a box plot.
python char_gen.py -[options]
Result plotting (plot_results.py)
The program can present various graphs of the data collected from a training run. The "window size" prompt refers to the size (in batches) of the window used for a rolling average over the data.
python plot_results.py <results_csv file>
░▒▒▒░
▒▓▓▓▓▒▒░ ░▒▒░
░▓▓▓▓▓▓▓▒░ ░░░ ▒▓▓░ ░░░░░
▒▒▓▓▓▓▓▓▓▓▒ ░▒▒ ░▒▓▓░ ░▒▒▒▒▒▒░
▒▓▓▓▓▒▒▒▓▓▓▓░ ▒▓▓░ ░▒▓▒░ ░░▒▒▓▓▓▓▓▒░
▒▓▓▓░ ░▓▓▓▒ ░▓▓▒░ ░▓▓░ ░▒▓▓▓▓▓▓▓▓▒░
░▓▓▓░ ░▓▓▓▒ ░▒▓▓▒░ ▒▓▓░ ░▒▓▓▓▓▓▓▓▓▓▓░
▒▓▓▒░ ░▓▓▓▒ ▒▒▓▓▒░ ░▒▓▓▒ ░▒▓▓▓▒▒▒▓▓▓▓▒
▒▓▓▒ ░▒▓▓▓▒ ▒▓▓▓▒ ▒▓▓▓░ ░▒▓▓▓▒░░▒▓▓▓▓▒
▒▓▓░ ▒▓▓▓▒░ ░▒▓▓▓░░▒▓▓▓ ░▒▓▓▓░░░▒▓▓▓▓▒
░▒▓▓░ ░▒▓▓▓▓▒ ░▒▓▓▒▓▓▓▓▒ ░▒▓▓▓▓▒░▒▓▓▓▓▓░
░▓▓▒░░▒▒▓▓▓▓▒ ░▓▓▓▓▓▓▓▒ ░▒▓▓▓▓▓▓▓▓▓▓▓▓▒░
▒▓▓░ ░▒▓▓▓▓▓▒░ ░▒▓▓▓▓▓▓░ ░▒▓▓▓▓▓▓▓▓▓▓▓▒░
░▒▓▓░░▒▓▓▓▓▓▓▓▒▒░ ▒▓▓▓▓▓░ ░▒▓▓▓▓▓▓▓▓▓▒░
▒▓▓▒▒▓▓▓▓▓▓▓▓▓▓▒░ ░░▓▓▓▒ ░▒▓▓▓▓▓▓▓▒░░
░▒▓▓▓▓▓▓▓▒▒▒▒▒▓▓▓▒░ ░▓▓▓▒ ░▒▓▓▓▓▓▓▒░░
░▓▓▓▓▓▒▒░░ ░▒▓▓▓░ ▒▓▓▓▒ ░▒▓▓▓▓▓▒░░ ░░░
▒▓▓▓▓▒░ ░▒▓▓▓▒ ▒▓▓▓▒ ░▒▓▓▓▓▓▓▒▒░░░░ ░▒▒▒░
▒▓▓▓▓▒ ▒▓▓▓▒▒ ▒▓▓▓░ ░▒▒▓▓▓▓▓▓▓▒▒▒▒▒▒▒▒▓▓▒
▒▓▓▓▒░░ ░▒▓▓▓▓▒░ ▒▓▓▒░ ░▒▒▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▒░
▒▓▓▓▓▓▒▒▒▒▒▒▓▓▓▓▒░ ▒▓▓▒░ ░▒▒▓▓▓▓▓▓▓▓▓▓▓▓▒▒░
░▒▓▓▓▓▓▓▓▓▓▓▓▓▓▒░ ▒▓▓▒ ░░▒▒▒▒▒▒▓▒▓▓▓▒░░
░░▒▓▓▓▓▓▓▓▓▓▓▓▒░░ ▒▒▒░ ░░░▒▒░▒▒▒▒░░
░░░░░░░░░░░░ ░░ ░░░