HaiFengZeng

HaiFengZeng's Stars

yxlllc/DDSP-SVC
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
Language:Python1.8k243
w-okada/voice-changer
リアルタイムボイスチェンジャー Realtime Voice Changer
Language:Python16.1k1.8k
BlinkDL/ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
Language:Python9.4k689
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python140k26.6k
revsic/torch-nansypp
NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
Language:Python14011
bloc97/Anime4K
A High-Quality Real Time Upscaler for Anime Video
Language:Jupyter Notebook18.3k1.3k
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python4.5k471
yangdongchao/InstructTTS
The deme page of InstructTTS
1558
microsoft/muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
Language:Python4.5k438
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
Language:Python2.9k416
gnobitab/RectifiedFlow
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
Language:Python84552
kivy/kivy
Open source UI framework written in Python, running on Windows, Linux, macOS, Android and iOS
Language:Python17.6k3.1k
BlinkDL/Hua
Hua is an AI image editor with Stable Diffusion (and more).
35324
dunky11/voicesmith
[WIP] VoiceSmith makes training text to speech models easy.
Language:Python21832
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python12.5k847
thu-coai/CDial-GPT
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
Language:Python1.8k255
lucidrains/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
Language:Python2.4k197
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Language:Python4.4k451
Rongjiehuang/GenerSpeech
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
Language:Python31345
NVIDIA/NeMo-text-processing
NeMo text processing for ASR and TTS
Language:Python26886
exaloop/codon
A high-performance, zero-overhead, extensible Python compiler using LLVM
Language:C++15k517
Tongjilibo/bert4torch
An elegent pytorch implement of transformers
Language:Python1.2k152
HFrost0/bilix
⚡️Lightning-fast async download tool for bilibili and more
Language:Python1.6k169
deepinsight/insightface
State-of-the-art 2D and 3D Face Analysis Project
Language:Python23k5.4k
KunZhou9646/Mixed_Emotions
Language:Python10311
mli/autocut
用文本编辑器剪视频
Language:Python6.6k658
MasayaKawamura/MB-iSTFT-VITS
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
Language:Python41764
tencent-ailab/bddm
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis
Language:Python21730
matrixcascade/PainterEngine
PainterEngine is a application/game engine with software renderer,PainterEngine can be transplanted to any platform that supports C
Language:C2.4k272
matrixcascade/SoundLab
基于重采样,相位声码器及BP神经网络基音分类的变声器,数学,UI及信号处理算法基于PainterEngine开发
Language:C15631