lhanzl

Pinned Repositories

emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Language:Python619 15 4343
sherpa-onnx
Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust
Language:C++3.5k 48 521411
awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案
02
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:Python00
WeTextProcessing
Text Normalization & Inverse Text Normalization
Language:Python0 0 00
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python6.8k 64 1.2k717
pipecat
Open Source framework for voice and multimodal conversational AI
Language:Python3.3k 26 141310

lhanzl's Repositories

lhanzl/awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案
02
lhanzl/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:Python00
lhanzl/WeTextProcessing
Text Normalization & Inverse Text Normalization
Language:Python0 0 00