zzpDapeng's Stars
THUDM/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
kyutai-labs/moshi
VITA-MLLM/VITA
✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
0nutation/SpeechGPT
SpeechGPT Series: Speech Large Language Models
audiolabs/rir-generator
thunlp/duplex-model
DefTruth/lite.ai.toolkit
🛠 A lite C++ toolkit of 100+ Awesome AI models, support ORT, MNN, NCNN, TNN and TensorRT. 🎉🎉
tencent-ailab/3m-asr
3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
laekov/fastmoe
A fast MoE impl for PyTorch
aceimnorstuvwxz/toutiao-text-classfication-dataset
今日头条中文新闻(文本)分类数据集
JackHCC/Chinese-Text-Classification-PyTorch
中文文本分类任务,基于PyTorch实现(TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention, DPCNN, Transformer,Bert,ERNIE),开箱即用!
MagicHub-io/CSASR_Challenge
gentaiscool/code-switching-papers
A curated list of research papers and resources on code-switching
ilius/pyglossary
A tool for converting dictionary files aka glossaries. Mainly to help use our offline glossaries in any Open Source dictionary we like on any modern operating system / device.
hankcs/HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
jctian98/e2e_lfmmi
E2E system with LF-MMI; word N-gram for Mandarin
TencentGameMate/chinese_speech_pretrain
chinese speech pretrained models
pcottle/learnGitBranching
An interactive git visualization and tutorial. Aspiring students of git can use this app to educate and challenge themselves towards mastery of git!
zycv/awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Jarrettluo/ADAS_Evaluation_Launcher
wenet-e2e/wekws
Production First and Production Ready End-to-End Keyword Spotting Toolkit
k2-fsa/k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
wenet-e2e/WenetSpeech
A 10000+ hours dataset for Chinese speech recognition
alibaba/Alibaba-MIT-Speech
Alibaba speech technology
tencent-ailab/pika
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
yistLin/dvector
Speaker embedding (d-vector) trained with GE2E loss
facebookresearch/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
maum-ai/voicefilter
Unofficial PyTorch implementation of Google AI's VoiceFilter system