Does have a way to load the model only once and use it for multi audio file in the same time?
Closed this issue · 2 comments
farmer00317558 commented
I want to process multi audio files in the same time instead process them one by one.
I tried making the WhisperContext as a Singleton, but when got an error:
Assertion failed: (sum > 0.0f), function ggml_compute_forward_flash_attn_f16, file ggml.c, line 6299.
Maybe it's a problem of whisper.cpp.
tazz4843 commented
whisper.cpp does not support this, you must load a new model for each thread.
tazz4843 commented
whisper.cpp version 1.3.0 was released two days ago, and should support this now. I'll be working on adding the features into whisper-rs tonight.