RosBusinessConsulting/video-to-text
Simple example of mpeg4 audio track to text conversion. Uses ffmpeg to cut the audio track from mp4 file, performs speech recognition via Vosk API and Vosk model and returns text as result. A utility for calculating metrics based on the reference text is included.
Python
No issues in this repository yet.