StoryBoardSD: A Python repository from Story Squad

This is Story Squads takeover and modification of the stable diffusion webui. We are using this to generate videos for our products and marketing efforts.

we are currently working on extracting the video generation code from the webui and making it into a standalone API.

Requirements

Hardware

A computer with a GPU that supports CUDA 11.1 or higher.
16GB of RAM or more.
~20GB of free disk space.
8GB of free VRAM.
A heart.
A soul.
A sense of adventure.
A sense of wonder.
A sense of humor.

Software

Windows 10, macOS 10.15, or Linux.

There are wonderful automatic install scripts for Windows and Linux. If you're on macOS, you'll have to follow the instructions

Automatic Installation on Windows

Install Python 3.10.6, checking "Add Python to PATH"
Install git.
Download the stable-diffusion-webui repository, for example by running git clone https://github.com/AUTOMATIC1111/stable-diffusion-webui.git.
Place model.ckpt in the models directory (see dependencies for where to get it).
Run webui-user.bat from Windows Explorer as normal, non-administrator, user.

Automatic Installation on Linux

Install the dependencies:

# Debian-based:
sudo apt install wget git python3 python3-venv
# Red Hat-based:
sudo dnf install wget git python3
# Arch-based:
sudo pacman -S wget git python3

To install in /home/$(whoami)/stable-diffusion-webui/, run:

bash <(wget -qO- https://raw.githubusercontent.com/AUTOMATIC1111/stable-diffusion-webui/master/webui.sh)

Installation on Apple Silicon

Find the instructions here.

Credits

Stable Diffusion - https://github.com/CompVis/stable-diffusion, https://github.com/CompVis/taming-transformers
k-diffusion - https://github.com/crowsonkb/k-diffusion.git
GFPGAN - https://github.com/TencentARC/GFPGAN.git
CodeFormer - https://github.com/sczhou/CodeFormer
ESRGAN - https://github.com/xinntao/ESRGAN
SwinIR - https://github.com/JingyunLiang/SwinIR
Swin2SR - https://github.com/mv-lab/swin2sr
LDSR - https://github.com/Hafiidz/latent-diffusion
Ideas for optimizations - https://github.com/basujindal/stable-diffusion
Doggettx - Cross Attention layer optimization - https://github.com/Doggettx/stable-diffusion, original idea for prompt editing.
InvokeAI, lstein - Cross Attention layer optimization - https://github.com/invoke-ai/InvokeAI (originally http://github.com/lstein/stable-diffusion)
Rinon Gal - Textual Inversion - https://github.com/rinongal/textual_inversion (we're not using his code, but we are using his ideas).
Idea for SD upscale - https://github.com/jquesnelle/txt2imghd
Noise generation for outpainting mk2 - https://github.com/parlance-zz/g-diffuser-bot
CLIP interrogator idea and borrowing some code - https://github.com/pharmapsychotic/clip-interrogator
Idea for Composable Diffusion - https://github.com/energy-based-model/Compositional-Visual-Generation-with-Composable-Diffusion-Models-PyTorch
xformers - https://github.com/facebookresearch/xformers
DeepDanbooru - interrogator for anime diffusers https://github.com/KichangKim/DeepDanbooru
Initial Gradio script - posted on 4chan by an Anonymous user. Thank you Anonymous user.
StoryBoard - Tasha Upchurch Via StorySquad