This an implementation of Toward Multimodal Image-to-Image Translation.
- pytorch 1.0
- numpy, Pillow, opencv
- Download
edges2shoes
dataset from here. - Edit the path in
train.py
file and run it.
You can also download my trained models from here (run01.tar.gz).
You can try them using inference/test.ipynb
.
- For the generator I use resnet-like architecture not unet (like they do in the original paper).
- I insert style information into the generator using AdaIN layers (like in StyleGAN).
- I feed into the generator not only edges but also a binary mask.
- I also use the binary mask to mask the outputs of the discriminators.
This code is inspired by