maozhiqiang

Pinned Repositories

callcenter
Language:Python1 1 01
FastMaskRCNN
Mask RCNN in TensorFlow
Language:Python1 2 00
multi-speaker-tacotron-tensorflow
Multi-speaker Tacotron in TensorFlow.
Language:Python1 2 00
neural_chinese_transliterator
Can CNNs transliterate Pinyin into Chinese characters correctly?
Language:Python1 2 00
normalizing-flows-tutorial
Tutorial on normalizing flows.
Language:Jupyter Notebook1 1 00
Pinyin2Hanzi
拼音转汉字，拼音输入法引擎， pin yin -> 拼音
Language:Python2 1 00
SING
Symbol-to-Instrument Neural Generator
Language:Python10
tacotron2-2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Language:Jupyter Notebook3 2 00
TTS-Cube
End-2-end speech synthesis with recurrent neural networks
Language:Python1 1 00
wavernn
pytorch implement wavernn
Language:Python7 2 03

maozhiqiang's Repositories

maozhiqiang/ttsGAN-ICLR2019
Language:Python1 1 0
maozhiqiang/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
Language:Python1 0
maozhiqiang/bana-tts
Language:Jupyter Notebook1 0
maozhiqiang/contentvec
speech self-supervised representations
Language:Python1 0
maozhiqiang/Coqui-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Jupyter Notebook1 0
maozhiqiang/DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
Language:Python1 0
maozhiqiang/DJtransGAN
"Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial Networks", ICASSP 2022
maozhiqiang/DocProduct
Medical Q&A with Deep Language Models
Language:Jupyter Notebook2 0
maozhiqiang/FaceFormer
[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers
maozhiqiang/g2pM
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
Language:Python2 01
maozhiqiang/hardware_introduction
What scienfitic programmers must know about CPUs and RAM to write fast code.
Language:Jupyter Notebook1 0
maozhiqiang/headliner
🏖 Easy training and deployment of seq2seq models.
Language:Python2 0
maozhiqiang/KAN-TTS
maozhiqiang/LiveSpeechPortraits
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)
maozhiqiang/melgan-neurips
Language:Python2 0
maozhiqiang/MelGAN-VC
MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms
Language:Jupyter Notebook1 0
maozhiqiang/musika
Fast Infinite Waveform Music Generation
Language:Python1 0
maozhiqiang/Neural-Style-Transfer-Audio
This is PyTorch Implementation Of Naural Style Transfer Algorithm which is modified for Audios.
Language:Python2 0
maozhiqiang/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Language:Jupyter Notebook2 0
maozhiqiang/prosody
Helsinki Prosody Corpus and System for Predicting Prosodic Prominence from Text
Language:Python1
maozhiqiang/Real_Time_Image_Animation
The Project is real time application in opencv using first order model
Language:Python1 0
maozhiqiang/shell_command
1 0
maozhiqiang/SiFiGAN
Official implementation of the source-filter HiFiGAN vocoder
Language:Python1 0
maozhiqiang/SpanPSP
maozhiqiang/state-spaces
Sequence Modeling with Structured State Spaces
maozhiqiang/StyleGAN2
maozhiqiang/The-Art-of-Linear-Algebra
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
maozhiqiang/TrWebOCR
开源易用的中文离线OCR，识别率媲美大厂，并且提供了易用的web页面及web的接口，方便人类日常工作使用或者其他程序来调用~
Language:Python1 0
maozhiqiang/U-2-Net
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
Language:Python1 0
maozhiqiang/WindTerm
A quicker and better cross-platform SSH/Sftp/Shell/Telnet/Serial client.
Language:C1 0