lhl/voicechat2

Look at SenseVoice SRT

lhl opened this issue · 1 comments

lhl commented

Makes some big claims for latency, WER - has a bunch of ASR features built in:
https://github.com/FunAudioLLM/SenseVoice

lhl commented

Added implementation but it calls out to ffmpeg and fails for some reason (ffmpeg on the tmpfile has no problem reading it)