[Guide] Significally increase speed with sd 2.1 turbo
light-and-ray opened this issue · 7 comments
In webui was added support of sd_turbo. It's generally turbo version of sd 2.1 512. I've tested, and it works nice with StableSR. You need to:
- update webui to 1.7.0 version
- download sd_turbo model
- (upd webui 1.8 has native lcm sampler support) install lcm sampler or use it from animate-diff extension, if you have it installed
- install StableSR 512 model
- decrease sampling steps to 4
- use Stable SR stablesr_webui_sd-v2-1-512-ema-000117 with sd_turbo
The 768 model should also work? I guess.
The 768 model should also work? I guess.
There's no sd turbo for the 768 model. Stablesr768 + sd 2.1 512 doesn't work
Also I've noticed higher number of steps produces more blurred image. So 4 steps not only the fastest, it also has the best quality. 3 and less produce a lot of artifacts
Also I've noticed higher number of steps produces more blurred image. So 4 steps not only the fastest, it also has the best quality. 3 and less produce a lot of artifacts
I guess it is because sd turbo directly distilled on 4 steps. I will consider finetuning current model on the sd turbo if I have time later :)Thx for your help.
Also I've noticed higher number of steps produces more blurred image. So 4 steps not only the fastest, it also has the best quality. 3 and less produce a lot of artifacts
I guess it is because sd turbo directly distilled on 4 steps. I will consider finetuning current model on the sd turbo if I have time later :)Thx for your help.
First of all, I love all the work you've done on this. The results are brilliant and I've used StableSR successfully for months now.
I was curious if there's still a plan to eventually release an SDXL model? Or did that idea get abandoned? Perhaps due to time or feasibility?
Hi. Thx for your interest and glad to hear that.
We are interested in developing a XL version. But I am currently struggling with my PhD projects and may not be available for that.
Heads up that an SD2.1 model (LoRA) with similar low step inference was recently released here: https://huggingface.co/h1t/TCD-SD21-base-LoRA
They use a method called Trajectory Consistency Distillation (TCD) as explained in their paper here: https://arxiv.org/abs/2402.19159
Any thoughts on whether or not this would work with StableSR similar to how Turbo has? If so it's worth checking to see if their model was trained to also work w/ the 768x768 resolution.
If anyone gives it a go before I get a chance please report your findings!