How to convert FLUX.1-Depth/Canny/Fill-dev.safetensors to Q8?

Question

How to convert FLUX.1-Depth/Canny/Fill-dev.safetensors to Q8?

ymzlygw opened this issue 3 months ago · 4 comments

Hi, blackforest just released their strong controlnet model. But It is too big sample as flux.1 dec fp16 (24GB).
could you please how to convert the FLUX.1-Depth/Canny/Fill-dev.safetensors to Q8 and can you support it ? Thanks!

Answer 1 · 2025-01-14T05:56:46.000Z

@city96 Could you help me about this ? thanks!

Answer 2 · 2025-01-17T07:38:13.000Z

search youtube video, there is a colab to do so. But I do not know how to do that.

Answer 3 · 2025-01-20T01:30:11.000Z

search youtube video, there is a colab to do so. But I do not know how to do that.

can you share the video url ? I don't know search what key words

Answer 4 · 2025-02-23T11:28:42.000Z

There are conversions available on huggingface. Here for example is quantized FLUX.1-Fill-dev:
https://huggingface.co/YarvixPA/FLUX.1-Fill-dev-gguf/tree/main

FLUX.1-Fill-dev.gguf itself is supported by the ComfyUI-GGUF nodes

The problem I have is that FLUX.1 Dev LoRAs are not working with FLUX.1-Fill-dev.gguf. This is the error I get:
ERROR lora diffusion_model.img_in.weight shape '[3072, 384]' is invalid for input of size 196608

Are the LoRAs even supposed to work with FLUX.1-Fill-dev ?

EDIT:

At least these turbo/hyper LoRAs cause the error message:
FLUX.1-Turbo-Alpha
Hyper-FLUX.1-dev-8steps-lora

Other LoRAs I tested do work, but the likeness of faces is much worse with FLUX.1-Fill-dev compared to FLUX.1 Dev.