Does have a way to load the model only once and use it for multi audio file in the same time?

Question

Closed this issue 2 years ago · 2 comments

I want to process multi audio files in the same time instead process them one by one.

I tried making the WhisperContext as a Singleton, but when got an error:

Assertion failed: (sum > 0.0f), function ggml_compute_forward_flash_attn_f16, file ggml.c, line 6299.

Maybe it's a problem of whisper.cpp.

Answer 1 · 2023-04-14T05:05:44.000Z

whisper.cpp does not support this, you must load a new model for each thread.

Answer 2 · 2023-04-17T21:38:07.000Z

whisper.cpp version 1.3.0 was released two days ago, and should support this now. I'll be working on adding the features into whisper-rs tonight.