/whisper-vamp-plugin

The Whisper Vamp plugin is an implementation of the Whisper speech recognition model developed by OpenAI as a Vamp plugin

Primary LanguageC++MIT LicenseMIT

Whisper Vamp Plugin

Workflows

The Whisper plugin is an implementation of the Whisper speech recognition model developed by OpenAI as a Vamp plugin. The Whisper plugin analyses the text in the audio stream and generates markers corresponding to the tokens (words and/or syllables) found. The lightweight ggml-tiny model is embedded in the plugin (so you don’t have to download anything to start experimenting), but it is possible to download and use other models that may be more appropriate to your needs.

The Whisper Vamp Plugin has been designed for use in the free audio analysis application Partiels.

Screenshot

Installation

Download the Whisper Vamp plugin installation package for your operating system from the Releases section and run the installer.

Use

Launch the Partiels application. In a new or existing document, create a new analysis track with the Whisper plugin. Modify the model or the analysis parameters via the property window. Please refer to the manual available in the Releases section for further information.

Compilation

The compilation system is based on CMake, for example:

cmake . -B build -G Ninja -DCMAKE_BUILD_TYPE=Release
cmake --build build
ctest -C Debug -VV --test-dir build

Credits