Does it support Whisper model?

Question

Does it support Whisper model?

Closed this issue 4 months ago · 2 comments

Answer 1 · 2024-07-16T03:19:49.000Z

In theory, eetq supports all models supported by transformers, you can try this:

from transformers import AutoModelForCausalLM, EetqConfig

path = "/path_to_model"
quantization_config = EetqConfig("int8")
model = AutoModelForCausalLM.from_pretrained(path, device_map="auto", quantization_config=quantization_config)

Answer 2 · 2024-07-16T17:19:37.000Z

I tested it and it works. What can I do to optimize further? Have you tested with Torch.compile?