HaiFengZeng's Stars
yxlllc/DDSP-SVC
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
w-okada/voice-changer
リアルタイムボイスチェンジャー Realtime Voice Changer
BlinkDL/ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
revsic/torch-nansypp
NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
bloc97/Anime4K
A High-Quality Real Time Upscaler for Anime Video
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
yangdongchao/InstructTTS
The deme page of InstructTTS
microsoft/muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
gnobitab/RectifiedFlow
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
kivy/kivy
Open source UI framework written in Python, running on Windows, Linux, macOS, Android and iOS
BlinkDL/Hua
Hua is an AI image editor with Stable Diffusion (and more).
dunky11/voicesmith
[WIP] VoiceSmith makes training text to speech models easy.
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
thu-coai/CDial-GPT
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
lucidrains/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Rongjiehuang/GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
NVIDIA/NeMo-text-processing
NeMo text processing for ASR and TTS
exaloop/codon
A high-performance, zero-overhead, extensible Python compiler using LLVM
Tongjilibo/bert4torch
An elegent pytorch implement of transformers
HFrost0/bilix
⚡️Lightning-fast async download tool for bilibili and more
deepinsight/insightface
State-of-the-art 2D and 3D Face Analysis Project
KunZhou9646/Mixed_Emotions
mli/autocut
用文本编辑器剪视频
MasayaKawamura/MB-iSTFT-VITS
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
tencent-ailab/bddm
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis
matrixcascade/PainterEngine
PainterEngine is a application/game engine with software renderer,PainterEngine can be transplanted to any platform that supports C
matrixcascade/SoundLab
基于重采样,相位声码器及BP神经网络基音分类的变声器,数学,UI及信号处理算法基于PainterEngine开发