pzl1744

pzl1744's Stars

2dust/v2rayN
A GUI client for Windows, Linux and macOS, support Xray core and sing-box-core and others
Language:C#72.9k 744 5.2k11.9k
hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。
Language:Python28.3k 153 6252.8k
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python7k 55 2071.3k
MoonInTheRiver/DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
Language:Python4.4k 43 103718
williamyang1991/VToonify
[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer
Language:Jupyter Notebook3.6k 62 76449
yoyo-nb/Thin-Plate-Spline-Motion-Model
[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
Language:Jupyter Notebook3.5k 63 91556
project-baize/baize-chatbot
Let ChatGPT teach your own chatbot in hours with a single GPU!
Language:Python3.2k 49 53287
openvpi/DiffSinger
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Language:Python2.8k 36 106288
salu133445/musegan
An AI for Music Generation
Language:Python1.9k 50 129376
williamyang1991/DualStyleGAN
[CVPR 2022] Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer
Language:Jupyter Notebook1.6k 28 103255
magenta/mt3
MT3: Multi-Task Multitrack Music Transcription
Language:Python1.5k 27 91195
apple/ml-neuman
Official repository of NeuMan: Neural Human Radiance Field from a Single Video (ECCV 2022)
Language:Python1.3k 35 98145
NVIDIA/mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
Language:Jupyter Notebook856 29 95182
marcoppasini/musika
Fast Infinite Waveform Music Generation
Language:Python668 23 3949
Jittor/JNeRF
JNeRF is a NeRF benchmark based on Jittor. JNeRF re-implemented instant-ngp and achieved same performance with original paper.
Language:C++643 18 5575
leimao/Voice-Converter-CycleGAN
Voice Converter Using CycleGAN and Non-Parallel Data
Language:Python525 12 36127
SforAiDl/Neural-Voice-Cloning-With-Few-Samples
This repository has implementation for "Neural Voice Cloning With Few Samples"
Language:Python432 31 22123
KinglittleQ/GST-Tacotron
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Language:Python366 14 1771
deterministic-algorithms-lab/Cross-Lingual-Voice-Cloning
Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
Language:Jupyter Notebook358 18 1557
keonlee9420/DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Language:Python323 10 2744
Sharad24/Neural-Voice-Cloning-with-Few-Samples
Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu
Language:Python253 14 455
AIFSH/NativeSpeaker
make your Speaker talking as Native style with own voice！
Language:Python250 14 562
CMsmartvoice/One-Shot-Voice-Cloning
:relaxed: One Shot Voice Cloning base on Unet-TTS
Language:Jupyter Notebook240 9 1740
Edresson/VoiceSplit
VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram
Language:Python234 8 1132
Seanseattle/StyleSwap
StyleSwap: Style-Based Generator Empowers Robust Face Swapping (ECCV 2022)
Language:Python207 38 1220
PlayVoice/VI-SVS
Singing Voice Synthesis based on VITS, different from VISinger
Language:Python186 8 1431
SMART-TTS/SMART-Single_Emotional_TTS
Language:Python97 2 438
foamliu/Tacotron2-Mandarin
PyTorch reimplementation of Tacotron2 in Mandarin
Language:Python81 4 1628
MingtaoGuo/StyleSwap
Unofficial implementation of the paper: StyleSwap: Style-Based Generator Empowers Robust Face Swapping
Language:Python50 10 47
zawawiAI/yolo_gpt
This is a GUI application that integrates YOLOv8 object recognition with OpenAI's GPT-3 language generation model.
Language:Python33 2 13