Look at SenseVoice SRT
lhl opened this issue · 1 comments
lhl commented
Makes some big claims for latency, WER - has a bunch of ASR features built in:
https://github.com/FunAudioLLM/SenseVoice
lhl commented
Added implementation but it calls out to ffmpeg and fails for some reason (ffmpeg on the tmpfile has no problem reading it)