text-to-image-generation

There are 152 repositories under text-to-image-generation topic.

  • NVlabs/Sana

    SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

    Language:Python4.7k76243308
  • Lightricks/ComfyUI-LTXVideo

    LTX-Video Support for ComfyUI

    Language:Python2.4k23236231
  • adobe-research/custom-diffusion

    Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

    Language:Python2k2994140
  • FoundationVision/Infinity

    [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

    Language:Python1.5k2411981
  • muzishen/IMAGDressing

    [AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual dressing.

    Language:Python1.3k1661116
  • AIDC-AI/Awesome-Unified-Multimodal-Models

    Awesome Unified Multimodal Models

    86526
  • songweige/rich-text-to-image

    Rich-Text-to-Image Generation

    Language:Python799191868
  • PKU-YuanGroup/UniWorld

    UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

    Language:Python788106924
  • Liquid

    FoundationVision/Liquid

    (Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators

    Language:Python626192333
  • markfulton/NanoBananaEditor

    The most advanced Nano Banana image generator and editor application. Your central hub for AI image generation and revisions. Intuitive UI features reference images, editing with image masks, version history, and more. Powered by Gemini 2.5 Flash images API.

    Language:TypeScript46534103
  • donahowe/AutoStudio

    AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

    Language:Jupyter Notebook447104731
  • Paranioar/Awesome_Matching_Pretraining_Transfering

    The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

  • ByteVisionLab/TokenFlow

    [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

    Language:Python3986275
  • OSU-NLP-Group/MagicBrush

    [NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".

    Language:Python38152614
  • woctezuma/stable-diffusion-colab

    Colab notebook for Stable Diffusion Hyper-SDXL.

    Language:Jupyter Notebook32831081
  • RockeyCoss/SPO

    [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization

    Language:Python25552810
  • CFGpp-diffusion/CFGpp

    Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models" (ICLR2025)

    Language:Python2307126
  • huggingface/diffusion-fast

    Faster generation with text-to-image diffusion models.

    Language:Python2296415
  • yunqing-me/AttackVLM

    [NeurIPS-2023] Annual Conference on Neural Information Processing Systems

    Language:Python21912516
  • tsunghan-wu/SLD

    🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)

    Language:Python1833610
  • GuoLanqing/Awesome-High-Resolution-Diffusion

    🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.

  • ExplainableML/ReNO

    [NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization

    Language:Python15651514
  • zituitui/BELM

    [NeurIPS 2024] Official implementation of "BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models".

    Language:Python1362138
  • somepago/DCR

    Official Pytorch repo of CVPR'23 and NeurIPS'23 papers on understanding replication in diffusion models.

    Language:Python1133125
  • yandex-research/swd

    Scale-wise Distillation of Diffusion Models

  • louisYen/Gen4Gen

    🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"

    Language:Python108954
  • QY-H00/attention-interpolation-diffusion

    [NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusion

    Language:Jupyter Notebook107215
  • Correr-Zhou/MagicTailor

    [IJCAI 2025 (Oral)] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models".

    Language:Python99843
  • j-min/DSG

    Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)

    Language:Jupyter Notebook98385
  • CSU-JPG/TextAtlas

    A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation

    Language:Python84440
  • mapo-t2i/mapo

    Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).

    Language:Python82149
  • YonghaoXu/Txt2Img-MHN

    [IEEE TIP 2023] Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks

    Language:Python781117
  • glami/glami-1m

    The largest multilingual image-text classification dataset. It contains fashion products.

    Language:Jupyter Notebook75627
  • haoosz/ConceptExpress

    [ECCV 2024 Oral] ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction

    Language:Python74368
  • PangzeCheung/SingDiffusion

    [CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models

    Language:Python71324
  • YangLing0818/ContextDiff

    [ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation

    Language:Python70484