This plugin feeds your editor viewport through the Diffusers library to create AI synthesized imagery.
Join us on Discord.
- Unreal Engine 5.0
- CUDA compatible GPU (Nvidia)
- Minimum 6GB Vram (to run Unreal and fp16 Stable Diffusion simultaneously)
Clone this repo and copy the StableDiffusionTools
folder to YourProjectFolder/Plugins/
or download the latest release from the releases page.
After installing the plugin, activate it through Windows->Plugins
. To open the main tool window for the plugin, go to Windows->StableDiffusionTools
.
The first time you use the plugin you will need to install the required python dependencies using the Update/Install dependencies
button in the Dependencies section of the main plugin window or through the Stable Diffusion Dependencies Installer
window. You may need to restart the Unreal Editor the first time you install dependencies.
This plugin provides three different generator backends out of the box. Local generation using the Diffusers library, remote generation using the Stability.AI SDK and Dream Studio and remote generation using Stable Horde. Choose a backend in Project Settings->Stable Diffusion Tools->Generator Type
and set a token in the Generator Tokens
map for your chosen generator.
To generate images using this plugin, you will first need a model. A few model presets are provided with the plugin and you can create your own by creating a StableDiffusionModelAsset
or you can just enter the model options in the plugin window directly.
To download a model from huggingface.co, you will need an account and a https://huggingface.co/settings/tokens with read permissions.
You can use any diffusers based model from the diffusers category on huggingface.co. To choose a model, enter it in the format Username/Modelname
. To use any of the default models RunwayML Stable Diffusion 1.5, Runway inpaiting, or CompViz Stable Diffusion 1.4, you will need to vist the respective model card page for each of these models and accept their usage agreement before downloading their weights otherwise you will receive an HTTP401 error.
The revision
property will allow you to pick a specific branch of the model to download. I recommend the fp16
branch (if the model provides it) in case you have a limited amount of VRAM (under 6GB). You can also choose the level of precision you want to model to use where 16-bit floating point is the default.
If you set the Covolution padding
model option to circular
, then generated images will have borders that will wrap around to the opposite side of the image. When combined with prompts such as a tiling texture of INSERT_SUBJECT_HERE
this will generate textures that are appropriate for mapping onto flat surfaces or landscapes.
Use the generation section to tweak your prompt, iteration count, viewport influence strength and the seed. If you are new to prompt authoring then I recommend reading this guide.
After you click the Generate image
button, the in-progress image will display in the plugin window and log its progress in the editor's Output Log
panel.
The Image outputs section contains options for saving a generated image to either a texture or an external file as well as upsampling options. Enter the destination paths and the name of the image and click the Save
button for the asset type of your choice, or upsample the output 4x first using Real-ESRGAN.
upsampled_AndroidChairs2.0001_2.mov
The plugin will add two new types of tracks to the Sequencer which can be added using the +Track
button, a Stable Diffusion Options
track and Stable Diffusion Prompt
tracks.
To create an animation, create a single Options track with a section that will span the length of your animation. You can right-click the section and go to properties to modify parameters that will stay consistent over the course of the animation such as the model options and output frame size.
The options track has 3 animatable channels, Iterations, Seed and Strength. All of these can be keyframed across the length of your animation.
To add prompts, you can create multiple prompt tracks containing multiple prompt sections. To set the prompt text, right-click on a prompt section, go to properties, and set the prompt text as well as the sentiment. A positive sentiment will tell Stable Diffusion that this prompt is something we would like to see in our generated output, whilst negative sentiments will instruct it to exclude unwanted prompts from the output.
Prompt tracks have a single animatable parameters, Weight
. Increasing or decreasing this value will increase or decrease the influence of the prompt in your animation.
To render your animation, click the Render movie
button to open the movie pipeline panel for your sequence. Add a SDMoviePipelineConfig
config asset to instruct the pipeline to export a Stable Diffusion image sequence. Rendered frames by default will show up in YourProjectFolder/Saved/MovieRenders
.
Note: Currently the Stable Diffusion Movie Pipeline will only export floating point images so I recommend sticking to OpenEXR as the output file type.
Click Render (local)
to render your stable diffusion animation. If you haven't already initialised a model, then the pipeline will use the model options set in your Options track to initialise a model at the start of the render which may cause a white screen.
Inpainting lets you fill only a masked portion of your input image whilst keeping areas outside the mask consistent. To use, load the Runway1-5_Inpaint
model preset or choose any other model and make sure that Enable inpainting
is set in the model options. To create a mask, add actors to an actor layer and then set the Inpaint layer
field in the UI to the actor layer you want to use as a mask for inpainting. If using the sequencer, you will need to set the inpaint layer in the properties of an options track and enable Render CustomDepth Pass
and set CustomDepth Stencil Value
to 1 for each actor on the layer.
IMPORTANT: When generating images from the UI panel, the input image will be too dark. This is due to the plugin now using a SceneCapture2D actor to capture the viewport to enable inpainting stencil support.
To fix this change the following change the following options on a global post processing volume:
- Metering mode = Manual
- Exposure Compensation = 15
- Fix texture memory usage.
- Panning the generated image only works when grabbing it from the upper right quarter.
- You will have to restart the editor after installing dependencies.
- Serialisable fields for the plugin UI.
- Re-use the last generated image in the viewport for img2img and instructpix2pix.
- Previz support for the sequencer using frame holds.
- Generate multiple image iterations in the viewport.