Code for the CVPR 2024 paper Text-guided Explorable Image Super-resolution.
This project is tested on a machine with
- OS: Ubuntu 22.04.4
- GPU: NVIDIA GeForce RTX 3090
Comprehensive list of packages used in the environment used for experiments are provided in requirements.txt
The codes are based on deep-floyd IF, karlo unCLIP, using Huggingface Diffusers. Three zero-shot methods DDNM, DPS, PiGDM are included for text guided super-resolution. We thank the authors and contributors of these repositories for making their code public!
- Imagen(DeepFloyd-IF) DDNM
- Imagen(DeepFloyd-IF) PiGDM
- Imagen(DeepFloyd-IF) DPS
- unCLIP DDNM.
We corrected a bug in our computation.The LR PSNRs using using Deepfloyd IF (Imagen) are now better than reported values in the paper.
As mentioned in the discussion section, even with high LR PSNR, results may not always be perceptually high quality. If the result is not satisfactory try running with different random seeds.
Download multimodal CelebA HQ to /data/work_data/multi_mod_celebahq
running command for testing 16x SR using DDNM with classifier free guidance
python run_dataset_imagen_superresolution.py --count 200 --scale 16 -g1 7 -g2 4 --run 3 --algo ddnm
running command for testing 16x SR using DPS with classifier free guidance
python run_dataset_imagen_superresolution.py --count 200 --scale 16 -g1 7 -g2 4 --algo dps --dps_scale 0.5 --dec_steps 250 --sr_steps 100 --start_time 100 --run 1
running command for testing 16x SR using PiGDM with classifier free guidance
python run_dataset_imagen_superresolution.py --count 200 --scale 16 -g1 7 -g2 4 --algo pigdm --dps_scale 0.5 --dec_steps 250 --sr_steps 100 --start_time 100 --run 1
Set -g1 1 and -g2 1 for disabling classifier-free guidance in both stages.
Set --scale 16 for 16x super-resolution.
Modify --dec_steps and --sr_steps to control the number of diffusion steps in the first stage and second stage.
If you find our workuseful for your research, please consider citing
@InProceedings{Gandikota_2024_CVPR,
author = {Gandikota, Kanchana Vaishnavi and Chandramouli, Paramanand},
title = {Text-guided Explorable Image Super-resolution},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2024},
pages = {25900-25911}
}