text-to-image-generation

There are 118 repositories under text-to-image-generation topic.

  • NVlabs/Sana

    SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

    Language:Python4.5k75228296
  • Lightricks/ComfyUI-LTXVideo

    LTX-Video Support for ComfyUI

    Language:Python2.3k8136217
  • adobe-research/custom-diffusion

    Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

    Language:Python2k3194140
  • FoundationVision/Infinity

    [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

    Language:Python1.4k2411778
  • muzishen/IMAGDressing

    [AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual dressing.

    Language:Python1.3k1456113
  • songweige/rich-text-to-image

    Rich-Text-to-Image Generation

    Language:Python801191868
  • Liquid

    FoundationVision/Liquid

    Liquid: Language Models are Scalable and Unified Multi-modal Generators

    Language:Python6142434
  • donahowe/AutoStudio

    AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

    Language:Jupyter Notebook438104733
  • Paranioar/Awesome_Matching_Pretraining_Transfering

    The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

  • OSU-NLP-Group/MagicBrush

    [NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".

    Language:Python37652615
  • woctezuma/stable-diffusion-colab

    Colab notebook for Stable Diffusion Hyper-SDXL.

    Language:Jupyter Notebook32641081
  • byteflow-ai/TokenFlow

    [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

    Language:Python3056271
  • huggingface/diffusion-fast

    Faster generation with text-to-image diffusion models.

    Language:Python2266415
  • CFGpp-diffusion/CFGpp

    Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models" (ICLR2025)

    Language:Python2247126
  • RockeyCoss/SPO

    [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization

    Language:Python1975256
  • yunqing-me/AttackVLM

    [NeurIPS-2023] Annual Conference on Neural Information Processing Systems

    Language:Python18712414
  • tsunghan-wu/SLD

    🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)

    Language:Python182358
  • GuoLanqing/Awesome-High-Resolution-Diffusion

    🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.

  • ExplainableML/ReNO

    [NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization

    Language:Python13451111
  • zituitui/BELM

    [NeurIPS 2024] Official implementation of "BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models".

    Language:Python1232127
  • louisYen/Gen4Gen

    🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"

    Language:Python108955
  • somepago/DCR

    Official Pytorch repo of CVPR'23 and NeurIPS'23 papers on understanding replication in diffusion models.

    Language:Python1053125
  • QY-H00/attention-interpolation-diffusion

    [NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusion

    Language:Jupyter Notebook97214
  • j-min/DSG

    Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)

    Language:Jupyter Notebook86385
  • yandex-research/swd

    Scale-wise Distillation of Diffusion Models

  • Correr-Zhou/MagicTailor

    [arXiv 2024] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models".

    Language:Python84843
  • CSU-JPG/TextAtlas

    A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation

    Language:Python78430
  • YonghaoXu/Txt2Img-MHN

    [IEEE TIP 2023] Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks

    Language:Python751117
  • glami/glami-1m

    The largest multilingual image-text classification dataset. It contains fashion products.

    Language:Jupyter Notebook73627
  • mapo-t2i/mapo

    Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).

    Language:Python71148
  • YangLing0818/ContextDiff

    [ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation

    Language:Python70484
  • humansensinglab/ITI-GEN

    [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation

    Language:Python680711
  • PangzeCheung/SingDiffusion

    [CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models

    Language:Python66324
  • haoosz/ConceptExpress

    [ECCV 2024 Oral] ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction

    Language:Python65358
  • j-min/VPGen

    Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)

    Language:Jupyter Notebook56133
  • HotpotDesign/api-examples

    This repository illustrates how to use the Hotpot.ai API. Our API provides Stable Diffusion, image generator, text-to-image generator, background removal, image upscaler, photo restoration, and picture colorization.