text-to-image-generation
There are 77 repositories under text-to-image-generation topic.
adobe-research/custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
songweige/rich-text-to-image
Rich-Text-to-Image Generation
donahowe/AutoStudio
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
woctezuma/stable-diffusion-colab
Colab notebook for Stable Diffusion Hyper-SDXL.
OSU-NLP-Group/MagicBrush
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
huggingface/diffusion-fast
Faster generation with text-to-image diffusion models.
yunqing-me/AttackVLM
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
tsunghan-wu/SLD
🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)
CFGpp-diffusion/CFGpp
Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models"
RockeyCoss/SPO
Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step
somepago/DCR
Official Pytorch repo of CVPR'23 and NeurIPS'23 papers on understanding replication in diffusion models.
louisYen/Gen4Gen
🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"
GuoLanqing/Awesome-High-Resolution-Diffusion
🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.
QY-H00/attention-interpolation-diffusion
Interpolation Between Text-to-Image Generation!
j-min/DSG
Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)
ExplainableML/ReNO
ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
glami/glami-1m
The largest multilingual image-text classification dataset. It contains fashion products.
humansensinglab/ITI-GEN
[ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation
PangzeCheung/SingDiffusion
[CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models
YangLing0818/ContextDiff
[ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation
YonghaoXu/Txt2Img-MHN
[IEEE TIP 2023] Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks
mapo-t2i/mapo
Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).
HotpotDesign/api-examples
This repository illustrates how to use the Hotpot.ai API. Our API provides Stable Diffusion, image generator, text-to-image generator, background removal, image upscaler, photo restoration, and picture colorization.
j-min/VPGen
Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)
haoosz/ConceptExpress
[ECCV 2024 Oral] ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
Nithin-GK/UniteandConquer
[CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models
iSEE-Laboratory/DreamView
(ECCV 2024) Official implementation of Paper ''DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation''
AlonzoLeeeooo/LCDG
The official code implementation of "LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis".
Shentao-YANG/Dense_Reward_T2I
Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).
1jsingh/Divide-Evaluate-and-Refine
Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback
WebRevo/TEXT_ARTIFY
"Experience the magic of the 'Text to Image' project, where JavaScript transforms your text into captivating visuals using HTML5 and CSS3. Unlock the creative potential of digital storytelling and data visualization in a visually immersive experience."
Mamadou-Keita/VLM-DETECT
[ICASSP 2024] The official repo for Harnessing the Power of Large Vision Language Models for Synthetic Image Detection
LukasStruppek/Exploiting-Cultural-Biases-via-Homoglyphs
[Journal of Artificial Intelligence Research] Source code for our paper "Exploiting Cultural Biases via Homoglyphs in Text-to-Image Synthesis".
pyladiesams/personalization-with-text-to-image-diffusion-models-feb2024
Get familiar with different fine-tuning techniques for text-to-image models, and learn how to teach a diffusion model a concept of your choosing
zeyofu/Commonsense-T2I
Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]
kanugurajesh/Student-LMS
An application to make learning as fun as gaming