Pinned Repositories
callcenter
FastMaskRCNN
Mask RCNN in TensorFlow
multi-speaker-tacotron-tensorflow
Multi-speaker Tacotron in TensorFlow.
neural_chinese_transliterator
Can CNNs transliterate Pinyin into Chinese characters correctly?
normalizing-flows-tutorial
Tutorial on normalizing flows.
Pinyin2Hanzi
拼音转汉字, 拼音输入法引擎, pin yin -> 拼音
SING
Symbol-to-Instrument Neural Generator
tacotron2-2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
TTS-Cube
End-2-end speech synthesis with recurrent neural networks
wavernn
pytorch implement wavernn
maozhiqiang's Repositories
maozhiqiang/ttsGAN-ICLR2019
maozhiqiang/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
maozhiqiang/bana-tts
maozhiqiang/contentvec
speech self-supervised representations
maozhiqiang/Coqui-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
maozhiqiang/DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
maozhiqiang/DJtransGAN
"Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial Networks", ICASSP 2022
maozhiqiang/DocProduct
Medical Q&A with Deep Language Models
maozhiqiang/FaceFormer
[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers
maozhiqiang/g2pM
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
maozhiqiang/hardware_introduction
What scienfitic programmers must know about CPUs and RAM to write fast code.
maozhiqiang/headliner
🏖 Easy training and deployment of seq2seq models.
maozhiqiang/KAN-TTS
maozhiqiang/LiveSpeechPortraits
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)
maozhiqiang/melgan-neurips
maozhiqiang/MelGAN-VC
MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms
maozhiqiang/musika
Fast Infinite Waveform Music Generation
maozhiqiang/Neural-Style-Transfer-Audio
This is PyTorch Implementation Of Naural Style Transfer Algorithm which is modified for Audios.
maozhiqiang/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
maozhiqiang/prosody
Helsinki Prosody Corpus and System for Predicting Prosodic Prominence from Text
maozhiqiang/Real_Time_Image_Animation
The Project is real time application in opencv using first order model
maozhiqiang/shell_command
maozhiqiang/SiFiGAN
Official implementation of the source-filter HiFiGAN vocoder
maozhiqiang/SpanPSP
maozhiqiang/state-spaces
Sequence Modeling with Structured State Spaces
maozhiqiang/StyleGAN2
maozhiqiang/The-Art-of-Linear-Algebra
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
maozhiqiang/TrWebOCR
开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~
maozhiqiang/U-2-Net
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
maozhiqiang/WindTerm
A quicker and better cross-platform SSH/Sftp/Shell/Telnet/Serial client.