speechllm

There are 5 repositories under speechllm topic.

  • modelscope/FunASR

    A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

    Language:Python13.4k931.6k1.4k
  • FireRedTeam/FireRedASR

    Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.

    Language:Python1.6k1795138
  • aidayang/FunASR-OneClick

    FunASR实时语音识别版,识别麦克风和电脑内播放的声音,电脑语音打字软件

  • PigeonDan1/ps-slm

    TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks

    Language:Python10
  • SALT-Research/SHALLOW

    SHALLOW, the first hallucination benchmark for ASR models

    Language:Python