kangkangucas

kangkangucas's Stars

FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python9.6k935
serengil/deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Language:Python16.8k2.4k
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python8.1k618
microsoft/NeuralSpeech
Language:Python1.4k180
AlonzoLeeeooo/awesome-video-generation
A collection of awesome video generation studies.
Language:TeX42715
mohuangrui/ucasthesis
LaTeX Thesis Template for the University of Chinese Academy of Sciences
Language:TeX3.5k937
unlock-music/unlock-music
Unlock encrypted music file in browser. 在浏览器中解锁加密的音乐文件。
13.2k164
jim-schwoebel/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
1.8k230