wanghuii1

HangZhou, China

Pinned Repositories

LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Language:Python30.2k 193 4.7k3.7k
SMMA-Net
21
3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Language:Python1.1k 17 9092
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python5.8k 57 1.1k624
FunClip
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Language:Python3.2k 31 84335
TalkNet-ASD
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
Language:Python294 7 6667
3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Language:Python0 0 00
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. ｜语音识别工具包，包含丰富的性能优越的开源预训练模型，支持语音识别、语音端点检测、文本后处理等，具备服务部署能力。
Language:Python0 0 00
FunASR-APP
Applications based on speech related models from FunASR (Modelscope).
Language:Python0 0 00
modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
Language:Python0 0 00

wanghuii1/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Language:Python0 0 00
wanghuii1/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. ｜语音识别工具包，包含丰富的性能优越的开源预训练模型，支持语音识别、语音端点检测、文本后处理等，具备服务部署能力。
Language:Python0 0 00
wanghuii1/FunASR-APP
Applications based on speech related models from FunASR (Modelscope).
Language:Python0 0 00
wanghuii1/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
Language:Python0 0 00