Whisper with Silero VAD

This notebook combines Whisper with a separate VAD. This improves long-form transcriptions, at the cost of possibly missing a few lines. It also attempts to filter out hallucinations and obvious mistranslations.

How to use

Click here to open the notebook in Google Colab.
Run the Setup Whisper cell.
Upload your input audio to either the runtime itself, Google Drive, or a file hosting service with direct download links.
Set the audio_path and language variables, and then run the Run Whisper cell. (Note: Audio path is set automatically if you use the Upload cell)
Once it's done, the notebook will automatically download the generated SRT file.

ANonEntity/WhisperWithVAD

Whisper with Silero VAD

How to use