This is an repository that contains the resources for image-to-image translation (I2I) research.

Papers

WACV 2023

Panoptic-aware Image-to-Image Translation [pdf]
UVCGAN: UNet Vision Transformer cycle-consistent GAN for unpaired image-to-image translation [pdf] [PyTorch]

ICLR 2023

Multi-domain image generation and translation with identifiability guarantees [OpenReview] [No code]
Diffusion-based Image Translation using disentangled style and content representation [OpenReview] [PyTorch]
Dual Diffusion Implicit Bridges for Image-to-Image Translation [OpenReview] [PyTorch]

AAAI 2023

MIDMs: Matching Interleaved Diffusion Models for Exemplar-based Image Translation [pdf] [PyTorch]
- a diffusion-based matching-and-generation framework that interleaves cross-domain matching and diffusion in the latent space for I2I

NeurIPS 2022

EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations [pdf] [code]
- an energy-guided stochastic differential equations that utilizes an energy function pretrained on source and target domains to guide the SDE inference for unpaired I2I
Unsupervised Image-to-Image Translation with Density Changing Regularization [pdf] [code]
- an unsupervised I2I model based on a density changing assumption that we should match image patches of high probability density for different domains.

ECCV 2022

Multi-Curve Translator for High-Resolution Photorealistic Image Translation [pdf] [code]
- a Multi-Curve Translator which predicts both the individual pixels and the neighor pixels for high-resolution I2I.
ManiFest: Manifold Deformation for Few-shot Image Translation [pdf] [code]
- a few-shot image translation model that learns a context-aware representation of a target domain using a style manifold between source and proxy anchor domains.
Vector Quantized Image-to-Image Translation [pdf] [code]
- A I2I framework based on vector quantized content representation
Unpaired Image Translation via Vector Symbolic Architectures [pdf] [code]
- a I2I framework based on Vector Symbolic Architectures which defines algebraic operations in a hypervector space.
VecGAN: Image-to-Image Translation with Interpretable Latent Directions [pdf] [NO CODE]
- a I2I framework with interpretable latent directions using latent space factorization and controllable strength of change.
Bi-level Feature Alignment for Versatile Image Translation and Manipulation [pdf] [code]
- a I2I framework using a bi-level feature alignment strategy that adopts a top-k operation to rank block-wise features and dense attention between block features to reduce memory cost.

CVPR 2022

Exploring Patch-Wise Semantic Relation for Contrastive Learning in Image-to-Image Translation Tasks [pdf] [code]
- a I2I framework based on semantic relation consistency and regularization along with the decoupled contrastive learning
Alleviating Semantics Distortion in Unsupervised Low-Level Image-to-Image Translation via Structure Consistency Constraint [pdf] [code]
- a Structure Consistency Constraint that reduces the randomness of color transformation in I2I.
A Style-Aware Discriminator for Controllable Image Translation [pdf] [code]
- a style-aware discriminator that acts as both the critic and the style encoder to provide conditions for the generator in I2I.
Wavelet Knowledge Distillation: Towards Efficient Image-to-Image Translation [pdf] [NO CODE]
- a I2I method based on high frequency bands distillation from discrete wavelet transformation.
InstaFormer: Instance-Aware Image-to-Image Translation With Transformer [pdf] [code]
- a transformer-based architecture with with adaptive instance normalization for instance-aware I2I.
Maximum Spatial Perturbation Consistency for Unpaired Image-to-Image Translation [pdf] [code]
- a universal regularization technique for I2I called maximum spatial perturbation consistency which enforces the spatial perturbation function and translation operator to be commutative.
FlexIT: Towards Flexible Semantic Image Translation [pdf] [code]
- a semantic image translation method based on autoencoder latent space and multi-modal embedding space
Self-Supervised Dense Consistency Regularization for Image-to-Image Translation [pdf] [NO CODE]
- an auxiliary self-supervision loss with dense consistency regularization for I2I.
Unsupervised Image-to-Image Translation With Generative Prior [pdf] [code]
- a I2I framework that uses the generative prior from GANs to learn rich content correspondences across various domains
QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation [pdf] [code]
- a I2I framework based on a query-selected attention module, which compares feature distances in the source domain and select queries acc. to the measurement of signficance.

AAAI 2022

OA-FSUI2IT: A Novel Few-Shot Cross Domain Object Detection Framework with Object-Aware Few-Shot Unsupervised Image-to-Image Translation [pdf] [NO CODE]
- an Object-Aware Few-Shot Image Translation framework for few-shot cross domain object detection
Style-Guided and Disentangled Representation for Robust Image-to-Image Translation [pdf] [NO CODE]
- a I2I framework with a style-guided disriminator using flexible decision boundary and independent domain attributes

Surveys

Image-to-image translation: Methods and applications. 2021 [pdf]
- lightweight network design for better efficiency
- generalize to cross-modality tasks (e.g., NLP, speech)
Deep Generative Adversarial Networks for Image-to-Image Translation: A Review paper. 2020 [pdf]
- Solve mode collapse
- More realistic evaluation metrics
- More image diversity
- Deep Reinforcement Learning
- 3D image-to-image translation
- 3D datasets
- Cybersecurity applications
An Overview of Image-to-Image Translation Using Generative Adversarial Networks [pdf]
- Combine GAN with other methods (e.g., VAE) to stabilize training
- GAN compression for lightweight design
- Transfer other methods (e.g., SR, Attention, OT) to I2IT
- Remove unnecessary components
- Extend to video
Unsupervised Image-to-Image Translation: A Review [pdf]
Applications of I2I to rainy days
- Domain Bridge for Unpaired Image-to-Image Translation and Unsupervised Domain Adaptation [pdf]
- Closing the Loop: Joint Rain Generation and Removal via Disentangled Image Translation [pdf]
- From Rain Generation to Rain Removal [pdf]
- DerainCycleGAN: Rain Attentive CycleGAN for Single Image Deraining and Rainmaking [pdf]
- Close the Loop: A Unified Bottom-up and Top-down Paradigm for Joint Image Deraining and Segmentation [pdf]

Datasets

Metrics

Amazon Mechanical Turk (AMT)
Peak Signal-to-Noise Ratio (PSNR) ↑
Structural Similarity Index Measure (SSIM) ↑
Inception Score (IS) ↑
Fréchet Inception Distance (FID) ↓
Kernel Inception Distance (KID) ↓
Perceptual Distance (PD) ↓
Learned Perceptual Image Patch Similarity (LPIPS) ↓
FCN ↑
Density and Coverage (DC) ↑

ShenZheng2000/Awesome-Image-to-Image-Translation

Papers

WACV 2023

ICLR 2023

AAAI 2023

NeurIPS 2022

ECCV 2022

CVPR 2022

AAAI 2022

Surveys

Datasets

Metrics

Resources