Procedural Generation of 3D Worlds

This repository contains Generative Adversarial Network (GAN) and style transfer code that can be used to generate 3D worlds. See this this demo to explore some of the generated worlds, and this blog post for more information about the project.

GAN versions

Each version of the GAN has incremental updates that change the way the GAN works. Each version creates quite varied landscapes due to different hyperparameters and network configurations; thus, each version is kept separately (not just in the git log) so that anyone can use any of the GAN networks easily, at any time. The following list describes the updates at each iteration (in lieu of a git log ;) ):

v0: a basic GAN
v2.1: uses leaky ReLU activations in discriminator network instead of regular ReLU and gives a learning rate seed to the adam optimizer
v3: uses resize upscaling to remove checkerboarding artefacts (as outlined in this paper called "Deconvolution and Checkerboard Artifacts")
v4: uses two different learning rates for the discriminator and generator networks
v4.1: allows the GAN to train for an infinite number of iterations
v4.2: uses input images of 256x256px
v4.3: changes all tf.nn.relu activations in the generator to tf.nn.leaky_relu and scales the input image values to between 1 and 2 (instead of -1 and 1) to prevent values from getting stuck at zero
v5: returns to transpose upscaling with a 5x5px filter size (to see if the checkerboard will disappear in this case too)
v6: updates filter size to 4x4 in addition to different discriminator and generator learning rates
v6.1: allows for variable filter size
v7.0: returns to image resize upscaling, and includes the variable filter size
v8.0: decreases the number of convolutional layers to reduce the chance of overfitting
v9.0: changes the prediction probabilities slightly
v10.0: fixes a discriminator loss bug by resetting the input images to begin from the start once they have all been fed into the GAN (and reverts the prediction probabilities to the previous version before 9.0)

Style Transfer Network Files

style.py takes in the input arguments and parses them to ensure that all arguments are valid and all filepaths exist. It then passes the training to optimize.py and evaluates on evaluation.py
optimize.py takes in a range of arguments and tries to create the most accurate output image based on the style and content images given. It has been altered to take in old models and checkpoints and resume training from this point. It uses vgg.py to find the contents and styles.
vgg.py is a file that incorporates vgg19 to parse through photos to find both the content and styles of images.
transform.py holds the actual model that is being trained. It has been altered to upsample through image resizing rather than through convolutional transposing.
evaluate.py creates the final image based on the trained models. It takes the trained model and passes in the new input image to output the stylized image.
utils.py gets files and saves files to make the program more coherent.

Useful Scripts

This folder contains scripts to help create a clean dataset. delDarkUnvariedIdentical.py: deletes images that are below a certain lightness threshold, have low variation (e.g., all white or all black), or are identical to other images placed in a folder called "duplicates"

prepDataset.py: pickles images, preparing them to be input images for the GAN
scrapeMaps.py: retrieve map data automatically from the web using Selenium
manageSavedMaps.py: unzips maps and moves them to the correct folder. It deletes all other downloaded map-like files to save space
scaleToWidthxWidth: scales images to be a certain size
heightmap2stl.jar: converts a greyscale PNG image to a 3D STL file (origin), which is useful for prototyping before bringing the height map into Unity 3D as a terrain
- use this command to run the file: java -jar heightmap2stl.jar 'path to imagefile' 'height of model' 'height of base'
  - usually a model height of '100' and base height of '2' works quite well (and yes, use the quotes in the command)