speechllm

There are 5 repositories under speechllm topic.

modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python13.4k 93 1.6k1.4k
FireRedTeam/FireRedASR
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.
Language:Python1.6k 17 95138
aidayang/FunASR-OneClick
FunASR实时语音识别版，识别麦克风和电脑内播放的声音，电脑语音打字软件
12 2 00
PigeonDan1/ps-slm
TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks
Language:Python10
SALT-Research/SHALLOW
SHALLOW, the first hallucination benchmark for ASR models
Language:Python