text-to-image-generation

There are 91 repositories under text-to-image-generation topic.

  • adobe-research/custom-diffusion

    Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

    Language:Python1.9k3394139
  • muzishen/IMAGDressing

    [AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high fidelity and garment consistency for virtual dressing.

    Language:Python1.1k144391
  • songweige/rich-text-to-image

    Rich-Text-to-Image Generation

    Language:Python769201665
  • Lightricks/ComfyUI-LTXVideo

    LTX-Video Support for ComfyUI

    Language:Python50267530
  • donahowe/AutoStudio

    AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

    Language:Jupyter Notebook419104731
  • woctezuma/stable-diffusion-colab

    Colab notebook for Stable Diffusion Hyper-SDXL.

    Language:Jupyter Notebook32061082
  • OSU-NLP-Group/MagicBrush

    [NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".

    Language:Python31842114
  • huggingface/diffusion-fast

    Faster generation with text-to-image diffusion models.

    Language:Python2024414
  • CFGpp-diffusion/CFGpp

    Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models"

    Language:Python172795
  • yunqing-me/AttackVLM

    [NeurIPS-2023] Annual Conference on Neural Information Processing Systems

    Language:Python1662228
  • RockeyCoss/SPO

    Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization

    Language:Python1637224
  • tsunghan-wu/SLD

    🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)

    Language:Python159358
  • GuoLanqing/Awesome-High-Resolution-Diffusion

    🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.

  • ExplainableML/ReNO

    [NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization

    Language:Python109589
  • somepago/DCR

    Official Pytorch repo of CVPR'23 and NeurIPS'23 papers on understanding replication in diffusion models.

    Language:Python1054125
  • louisYen/Gen4Gen

    🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"

    Language:Python103955
  • zituitui/BELM

    [NeurIPS 2024] Official implementation of "BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models".

    Language:Python962104
  • QY-H00/attention-interpolation-diffusion

    [NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusion

    Language:Jupyter Notebook88311
  • j-min/DSG

    Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)

    Language:Jupyter Notebook79385
  • Correr-Zhou/MagicTailor

    Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models".

    Language:Python74833
  • glami/glami-1m

    The largest multilingual image-text classification dataset. It contains fashion products.

    Language:Jupyter Notebook69726
  • humansensinglab/ITI-GEN

    [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation

    Language:Python650611
  • PangzeCheung/SingDiffusion

    [CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models

    Language:Python64413
  • YonghaoXu/Txt2Img-MHN

    [IEEE TIP 2023] Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks

    Language:Python641106
  • mapo-t2i/mapo

    Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).

    Language:Python62237
  • YangLing0818/ContextDiff

    [ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation

    Language:Python62583
  • HotpotDesign/api-examples

    This repository illustrates how to use the Hotpot.ai API. Our API provides Stable Diffusion, image generator, text-to-image generator, background removal, image upscaler, photo restoration, and picture colorization.

  • j-min/VPGen

    Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)

    Language:Jupyter Notebook53233
  • haoosz/ConceptExpress

    [ECCV 2024 Oral] ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction

    Language:Python46448
  • LayoutLLM-T2I/LayoutLLM-T2I

    Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation

    Language:Python413100
  • iSEE-Laboratory/DreamView

    (ECCV 2024) Official implementation of Paper ''DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation''

    Language:Python36441
  • Nithin-GK/UniteandConquer

    [CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models

    Language:Python35543
  • AlonzoLeeeooo/LCDG

    The official code implementation of "LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis".

    Language:Python33343
  • Shentao-YANG/Dense_Reward_T2I

    Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).

    Language:Python33310
  • 1jsingh/Divide-Evaluate-and-Refine

    Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback

    Language:Jupyter Notebook25151
  • Mamadou-Keita/VLM-DETECT

    [ICASSP 2024] The official repo for Harnessing the Power of Large Vision Language Models for Synthetic Image Detection

    Language:Python19232