/layout2img

Context-Aware Layout to Image Generation with Enhanced Object Appearance

Primary LanguagePython

layout2img

This repository includes the implementation for Context-Aware Layout to Image Generation with Enhanced Object Appearance (to appear in CVPR 2021).

This repo is not completely.

Network Structure

network_structure

Requirements

  • python3
  • pytorch >1.0
  • numpy
  • matplotlib
  • opencv

Or install full requirements by running:

pip install -r requirements.txt

TODO

  • instruction to prepare dataset
  • remove all unnecessary files
  • add link to download our pre-trained model
  • clean code including comments
  • instruction for training
  • instruction for evaluation
  • instruction for applying our methods in layout2img

Training ImageTransformer

Data Preparation

Download COCO dataset to datasets/coco

bash scripts/download_coco.sh

Download VG dataset to datasets/vg

bash scripts/download_vg.sh
python scripts/preprocess_vg.py

Start training

See opts.py for the options. (You can download the pretrained models from here

Evaluation

Trained model

you can download our trained model from our onedrive repo

Performance

You will get the scores close to below after training for around 200 epochs:

models Resolutions IS-COCO IS-VG FID-COCO FID-VG
Ours-ED 64*64 15.27+/-.25 8.53+/-.13 31.32 33.91
Ours-D 128*128 15.62+/-.05 12.69+/-.45 22.32 21.78

Reference

If you find this repo helpful, please consider citing:

@inproceedings{he2021context,
  title={Context-Aware Layout to Image Generation with Enhanced Object Appearance},
  author={He, Sen and Liao, Wentong and Yang, Michael and Yang, Yongxin and Song, Yi-Zhe and Rosenhahn, Bodo and Xiang, Tao},
  booktitle={CVPR},
  year={2021}
}

Acknowledgements

This repository is based on LostGAN, and the propsoed modules can be applied in the layout2img.