ChiRenjun

ChiRenjun's Stars

lovemefan/SenseVoice.cpp
Port of Funasr's Sense-voice model in C/C++
Language:C1078
huggingface/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Language:Python3.1k331
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Jupyter Notebook10.8k1.1k
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:Python4.1k1.1k
Yuan-ManX/ai-audio-datasets
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.
47233
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Language:Python2.2k482
ufal/whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
Language:Python1.9k225
DongKeon/Awesome-Speaker-Diarization
Some comprehensive papers about speaker diarization
1983
QwenLM/Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
Language:Python1.1k66
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Language:Python1.9k151
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
Language:C34.8k3.5k
QwenLM/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
Language:Python1.4k105
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
Language:Python2.8k265
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python5.2k534
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++65.7k9.4k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python27.7k4.1k
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python68.3k8.1k
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
Language:Python11.6k969
huggingface/parler-tts
Inference and training library for high-quality TTS models.
Language:Python4.3k430
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python33.4k3.8k
mkunes/w2v2_audioFrameClassification
wav2vec2 audio classification for prosodic boundary detection and other tasks
Language:Jupyter Notebook326
yeyupiaoling/MASR
Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conformer、Squeezeformer、DeepSpeech2模型，支持多种数据增强方法。
Language:Python596106
ChristopherGS/ultimate-fastapi-tutorial
The Ultimate FastAPI Tutorial
Language:Python1k348
UnicomAI/Unichat-llama3-Chinese
Language:Python34434
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python6.2k657
modelscope/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Language:Python1.1k95
LlamaFamily/Llama-Chinese
Llama中文社区，Llama3在线体验和微调模型已开放，实时汇总最新Llama3学习资料，已将所有代码更新适配Llama3，构建最好的中文Llama大模型，完全开源可商用
Language:Python13.7k1.2k
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
1.6k225
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Language:Jupyter Notebook3.4k286
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6k757