/SLD

🔥 Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)"

MIT LicenseMIT

Self-correcting LLM-controlled Diffusion Models

arXiv

Authors: Tsung-Han Wu*, Long Lian*, Joseph E. Gonzalez, Boyi Li†, Trevor Darrell† at UC Berkeley.

TL;DR: The Self-correcting LLM-controlled Diffusion (SLD) Framework features:

  1. Self-correction: Enhances generative models with LLM-integrated detectors for precise text-to-image alignment.
  2. Unified Generation and Editing: Excels at both image generation and fine-grained editing.
  3. Universal Compatibility: Works with ANY image generator, like DALL-E 3, requiring no extra training or data.

SLD Framework

We will release our code soon!