/awesome-stable-diffusion

Curated list of awesome resources for the Stable Diffusion AI Model.

Mozilla Public License 2.0MPL-2.0

Awesome Stable-Diffusion

Awesome

This is a list of software and resources for the Stable Diffusion AI model.

  • ๐Ÿ–Š๏ธ marks content that requires sign-up or account creation for a third party service outside GitHub.
  • ๐Ÿ’ต marks Non-Free content: commercial content that may require any kind of payment.

Due to the fast-moving nature of the topic, entries in the list may be removed at an expedited rate until the ecosystem matures.

See Contributing.

Official Resources

  • CompVis/Stable Diffusion - The official release of Stable Diffusion including a CLI, an AI-based Safety Classifier, which detects and suppresses sexualized content, and all the necessary files to get running.
  • stability-AI/stability-sdk - The official SDK used to build python applications integrated with StabilityAI's cloud platform instead of hosting the model locally. Operation requires an API Key (๐Ÿ–Š๏ธ๐Ÿ’ต).
  • Public Release Announcement - StabilityAI's announcement about the public release of Stable Diffusion.
  • ๐Ÿ–Š๏ธ Official Discord - The official Stable Diffusion Discord by StabilityAI.
  • laion-aesthetic - The dataset used train stable diffusion, useful for querying to see if a concept is represented.

Actively Maintained Forks and Containers

All forks listed here add additional features and optimisations and are generally faster than the original release, as they keep the model in memory rather than reloading it after every prompt. Most forks seem to remove the Safety Classifier which may present a risk if used to provide public-facing services, such as Discord bots.

Models and Weights

Models (.ckpt files) must be separately downloaded and are required to run Stable Diffusion. The latest model release is v1.4.

  • ๐Ÿ–Š๏ธ Official Model Card - Official Model Card on Hugging Face with all versions of the model. Download requires sign-in and acceptance of terms of service.
  • stable-diffusion-v-1-4-original.chkpt - The latest model's card
  • RealESRGAN Models - Download location for the latest RealESRGAN models required to use the upscaling features implemented by many forks. Different models exist for realistic and anime content. Please refer to the fork documentation to identify the ones you need.

Online Demos and Notebooks

Complementary Models and Tools

Tools and models for use in conjuction with Stable Diffusion

Img2Img

  • Prompt to Prompt - Unofficial Implementation of Cross-attention-control for prompt to prompt image editing.

Customisation

GUIS

Upscaling

  • RealESRGAN - ESRGAN Upscaling (2x, 4x) and content restoration. Python: pip install realesrgan
  • Cupscale - GUI for running various upscaling models, including ESRGAN and RealESRGAN.
  • BasicSR - Open-source upscaling and restoration toolbox supporting several models.
  • BSRGAN - BSRGANโ€”another upscaling solution specialized in upscaling degraded images.

Content Restoration

  • lama-cleaner - Content aware AI inpainting tool useful for removing unwanted objects or defects from images. Python: pip install lama-cleaner
  • GFPGAN - Face Restoration GAN included in several forks for automatically fixing the face deformation commonly found in SD output.
  • CodeFormer - Another Face Restoration model (Paper).

Task Chaining

  • chaiNNer - Graphical node-based editor for chaining image processing tasks.
  • ai-art-generator - AI art generation suite combining Stable Diffusion and other models for high volume art generation.

Prompt Building

Prompts are the instructions given to diffusion models to manipulate their output.

  • PromptoMania - A visual prompt construction tool.
  • Lexica.art - A searchable, visual database of images and the prompts settings used to create them.
  • ๐Ÿ–Š๏ธPhraser - A visual prompt builder drawing on a database of examples. (Requires account creation)
  • ai-art.com/modifiers - A visual reference guide for keywords.
  • pharmapsychotic/clip-interrogator - Jupyter notebook uses CLIP models to suggest a prompt for images similar to a given image (Demo).
  • rom1504/clip-retrieval - Searches for prompt keywords in the datasets used in training Stable Diffusion and other models (Online GUI).

Tutorials and Comparisons

Tutorials and high quality educational resources

Getting Up and Running

Learning and Mastering

Studies

Extending Functionality

Community Resources

Social Media

Online Services implementing Stable Diffusion

  • ๐Ÿ–Š๏ธ๐Ÿ’ต Dream Studio - Online art generation service by StabilityAI, the creators of Stable Diffusion. Similar to services like DALL-E or Midjourney, this operates on a credit model with a free allowance of credits given to signed up users on a monthly basis.
  • ๐Ÿ–Š๏ธ๐Ÿ’ต dream.ai - Online art generation service by Wombo.ai (mobile apps available).