text-to-image-generation

There are 152 repositories under text-to-image-generation topic.

NVlabs/Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Language:Python4.7k 76 243308
Lightricks/ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
Language:Python2.4k 23 236231
adobe-research/custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
Language:Python2k 29 94140
FoundationVision/Infinity
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Language:Python1.5k 24 11981
muzishen/IMAGDressing
[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual dressing.
Language:Python1.3k 16 61116
AIDC-AI/Awesome-Unified-Multimodal-Models
Awesome Unified Multimodal Models
86526
songweige/rich-text-to-image
Rich-Text-to-Image Generation
Language:Python799 19 1868
PKU-YuanGroup/UniWorld
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
Language:Python788 10 6924
FoundationVision/Liquid
(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators
Language:Python626 19 2333
markfulton/NanoBananaEditor
The most advanced Nano Banana image generator and editor application. Your central hub for AI image generation and revisions. Intuitive UI features reference images, editing with image masks, version history, and more. Powered by Gemini 2.5 Flash images API.
Language:TypeScript465 3 4103
donahowe/AutoStudio
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
Language:Jupyter Notebook447 10 4731
Paranioar/Awesome_Matching_Pretraining_Transfering
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.
433 12 549
ByteVisionLab/TokenFlow
[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".
Language:Python398 6 275
OSU-NLP-Group/MagicBrush
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
Language:Python381 5 2614
woctezuma/stable-diffusion-colab
Colab notebook for Stable Diffusion Hyper-SDXL.
Language:Jupyter Notebook328 3 1081
RockeyCoss/SPO
[CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
Language:Python255 5 2810
CFGpp-diffusion/CFGpp
Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models" (ICLR2025)
Language:Python230 7 126
huggingface/diffusion-fast
Faster generation with text-to-image diffusion models.
Language:Python229 6 415
yunqing-me/AttackVLM
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
Language:Python219 1 2516
tsunghan-wu/SLD
🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)
Language:Python183 3 610
GuoLanqing/Awesome-High-Resolution-Diffusion
🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.
158 9 37
ExplainableML/ReNO
[NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
Language:Python156 5 1514
zituitui/BELM
[NeurIPS 2024] Official implementation of "BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models".
Language:Python136 2 138
somepago/DCR
Official Pytorch repo of CVPR'23 and NeurIPS'23 papers on understanding replication in diffusion models.
Language:Python113 3 125
yandex-research/swd
Scale-wise Distillation of Diffusion Models
1123
louisYen/Gen4Gen
🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"
Language:Python108 9 54
QY-H00/attention-interpolation-diffusion
[NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusion
Language:Jupyter Notebook107 2 15
Correr-Zhou/MagicTailor
[IJCAI 2025 (Oral)] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models".
Language:Python99 8 43
j-min/DSG
Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)
Language:Jupyter Notebook98 3 85
CSU-JPG/TextAtlas
A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation
Language:Python84 4 40
mapo-t2i/mapo
Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).
Language:Python82 1 49
YonghaoXu/Txt2Img-MHN
[IEEE TIP 2023] Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks
Language:Python78 1 117
glami/glami-1m
The largest multilingual image-text classification dataset. It contains fashion products.
Language:Jupyter Notebook75 6 27
haoosz/ConceptExpress
[ECCV 2024 Oral] ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
Language:Python74 3 68
PangzeCheung/SingDiffusion
[CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models
Language:Python71 3 24
YangLing0818/ContextDiff
[ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation
Language:Python70 4 84

text-to-image-generation

NVlabs/Sana

Lightricks/ComfyUI-LTXVideo

adobe-research/custom-diffusion

FoundationVision/Infinity

muzishen/IMAGDressing

AIDC-AI/Awesome-Unified-Multimodal-Models

songweige/rich-text-to-image

PKU-YuanGroup/UniWorld

FoundationVision/Liquid

markfulton/NanoBananaEditor

donahowe/AutoStudio

Paranioar/Awesome_Matching_Pretraining_Transfering

ByteVisionLab/TokenFlow

OSU-NLP-Group/MagicBrush

woctezuma/stable-diffusion-colab

RockeyCoss/SPO

CFGpp-diffusion/CFGpp

huggingface/diffusion-fast

yunqing-me/AttackVLM

tsunghan-wu/SLD

GuoLanqing/Awesome-High-Resolution-Diffusion

ExplainableML/ReNO

zituitui/BELM

somepago/DCR

yandex-research/swd

louisYen/Gen4Gen

QY-H00/attention-interpolation-diffusion

Correr-Zhou/MagicTailor

j-min/DSG

CSU-JPG/TextAtlas

mapo-t2i/mapo

YonghaoXu/Txt2Img-MHN

glami/glami-1m

haoosz/ConceptExpress

PangzeCheung/SingDiffusion

YangLing0818/ContextDiff