/whisper_transcriber

Primary LanguagePythonApache License 2.0Apache-2.0

OpenAI Whisper Transcriber

Introduction

This is a demo showing how to transcribe an .m4a audio file using OpenAI Whisper.

whisper-large model card

Pre-requisites

  • Ubuntu 20.04 LTS Focal
  • ffmpeg 4.2.7

Python

  • pytorch == 2.0.1
  • transformers == 4.29.2

See requirements.txt

Proxy usage

Please set REQUESTS_CA_BUNDLE according. E.g.

REQUESTS_CA_BUNDLE="/home/<user>/certs/cacert.pem"

Preprocessing

The audio sample needs to be downsampled into 16kHz, and .WAV format. FFMPEG can be used as follows:

ffmpeg -i magnets.m4a -ar 16000 magnets16K.wav

License

Apache 2.0