/video-to-text

Simple example of mpeg4 audio track to text conversion. Uses ffmpeg to cut the audio track from mp4 file, performs speech recognition via Vosk API and Vosk model and returns text as result. A utility for calculating metrics based on the reference text is included.

Primary LanguagePython

No issues in this repository yet.