Pinned Repositories
OpenGFW
OpenGFW is a flexible, easy-to-use, open source implementation of GFW (Great Firewall of China) on Linux
audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
GPT-SoVITS-VC
VC Without Retrain!
huangxu1991.github.io
huangxu blog
Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
RAM-multiprocess-dataloader
Demystify RAM Usage in Multi-Process Data Loaders
speechnas
SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification
VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
huangxu1991's Repositories
huangxu1991/GPT-SoVITS-VC
VC Without Retrain!
huangxu1991/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
huangxu1991/huangxu1991.github.io
huangxu blog
huangxu1991/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
huangxu1991/RAM-multiprocess-dataloader
Demystify RAM Usage in Multi-Process Data Loaders
huangxu1991/speechnas
SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification