kangkangucas's Stars
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
serengil/deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
microsoft/NeuralSpeech
AlonzoLeeeooo/awesome-video-generation
A collection of awesome video generation studies.
mohuangrui/ucasthesis
LaTeX Thesis Template for the University of Chinese Academy of Sciences
unlock-music/unlock-music
Unlock encrypted music file in browser. 在浏览器中解锁加密的音乐文件。
jim-schwoebel/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).