qiansichong

qiansichong's Stars

homebrewltd/ichigo
Llama3.1 learns to Listen
Language:Python42920
yangdongchao/RSTnet
Real-time Speech-Text Foundation Model Toolkit (wip)
Language:Python10410
xinchen-ai/Westlake-Omni
Language:Python955
wwbin2017/bailing
百聆是一个类似GPT-4o的语音对话机器人，通过ASR+LLM+TTS实现，时延低至800ms，低配置也可运行，支持打断
Language:Python173
kyutai-labs/moshi
Language:Python6.3k476
hsiehjackson/ASR-wav2vec2.0
This repo is for zh-TW ASR with wav2vec2.0.
Language:HTML41
OpenMOSS/AnyGPT
Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"
Language:Python76061
thu-coai/CDial-GPT
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
Language:Python1.8k255
0nutation/SpeechGPT
SpeechGPT Series: Speech Large Language Models
Language:Python1.3k82
YouTaoBaBa/Chinese-Dialogue-Dataset
用于汇总目前的开源中文对话数据集
1008
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python6.4k679
karpathy/LLM101n
LLM101n: Let's build a Storyteller
29.4k1.6k
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python5.5k569
Chivier/easy-gpt4o
Easy-GPT4O opensource version
Language:Python668
svpino/alloy-voice-assistant
Language:Python867245
panyanyany/Awesome-ChatTTS
ChatTTS资源大全，免费体验地址，音色库等
1.2k88
BasedHardware/OpenGlass
Turn any glasses into AI-powered smart glasses
Language:C3.3k410
6drf21e/ChatTTS_colab
🚀 一键部署（含离线整合包）！基于 ChatTTS ，支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用，无需复杂安装。
Language:Python2k253
libukai/Awesome-ChatTTS
官方推荐的 ChatTTS 资源汇总项目，整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project
1.1k73
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python31.5k3.4k
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Language:Python4.2k409
wdndev/llama3-from-scratch-zh
从零实现一个 llama3 中文版
Language:Jupyter Notebook49255
sonos/keyword-spotting-research-datasets
11621
aishoot/Sound_Localization_Algorithms
Classical algorithms of sound source localization with beamforming, TDOA and high-resolution spectral estimation.
Language:Jupyter Notebook381102
MarkFzp/act-plus-plus
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
Language:Python3k554
DavidDiazGuerra/gpuRIR
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
Language:Cuda48394
GAMMA-UMD/pygsound
Impulse response generation based on state-of-the-art geometric sound propagation engine.
Language:C++14321
noahzhy/NSNet2
TF, PyTorch implementation of the paper NSNet2
Language:Python6
Okrio/CRUSE
a lightweight network for monaural speech enhancement
Language:Python4910
crlandsc/BS-RoFormer
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
Language:Python3