OneDiffusion

HuggingFace Homepage arXiv

Teaser Image

Introduction

This is official repo of OneDiffusion, a versatile, large-scale diffusion model that seamlessly supports bidirectional image synthesis and understanding across diverse tasks. We will release the code and checkpoints in early December.

Qualitative Results

1. Text-to-Image

Text-to-Image results

2. ID customization

ID customization

ID customization non-human subject

3. Multiview generation

Single image to multiview:

Image to multiview

image to multiview

Text to multiview:

Text to multiview image

4. Condition-to-Image and vice versa

Condition and Image

Citation

@misc{le2024diffusiongenerate,
      title={One Diffusion to Generate Them All}, 
      author={Duong H. Le and Tuan Pham and Sangho Lee and Christopher Clark and Aniruddha Kembhavi and Stephan Mandt and Ranjay Krishna and Jiasen Lu},
      year={2024},
      eprint={2411.16318},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2411.16318}, 
}

Acknowledgements