speech-language-model
There are 6 repositories under speech-language-model topic.
ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
jishengpeng/WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
zhenye234/xcodec
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
hhguo/SoCodec
Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications
slp-rl/salmon
The official code for the SALMonš£ benchmark
lucadellalib/audiocodecs
A collections of audio codecs with a standardized API