tuocheng0824

tuocheng0824's Stars

yuan1615/AdaVocoder
Adaptive Vocoder for Custom Voice
Language:Python5910
Ryuk17/SpeechAlgorithms
You can find the speech algorithms you want here
Language:C761246
AGENDD/RWKV-ASR
This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the idea of SLAM_ASR and used the RWKV language model as the LLM, and instead of directly writing a prompt template we directly finetuned the initial state of the RWKV model.
Language:Python323
RicherMans/Dasheng
Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"
Language:Python443
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
Language:Python12.7k1.1k
WenzheLiu-Speech/awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
1.1k221
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python6.5k700
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
Language:Python3.5k318
X-LANCE/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
Language:Python59453
modelscope/FunClip
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Language:Python3.8k410
yeyupiaoling/Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
Language:C892145
wenet-e2e/wesignal
Production first, nn-based on-device signal processing toolkit.
643
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook36.2k4.3k
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
Language:Python7k723
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python7.1k755
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python71.9k8.5k
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:Python4.2k1.1k
wenet-e2e/wetts
Production First and Production Ready End-to-End Text-to-Speech Toolkit
Language:Python37260
hankcs/HanLP
中文分词词性标注命名实体识别依存句法分析成分句法分析语义依存分析语义角色标注指代消解风格转换语义相似度新词发现关键词短语提取自动摘要文本分类聚类拼音简繁转换自然语言处理
Language:Python34k10.2k
alphacep/vosk-server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Language:Python934249
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Language:Jupyter Notebook8.2k1.1k
Snowdar/asv-subtools
An Open Source Tools for Speaker Recognition
Language:Python602134
wildwolf1994411/VGG-Speaker-Recognition
Utterance-level Aggregation For Speaker Recognition In The Wild
Language:Python1
llp1992/MachineLearning
Language:Matlab155116