GaryGao99's Stars
NVIDIA/mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
pshved/timeout
A script to measure and limit CPU time and memory consumption of black-box processes in Linux
NVIDIA/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
momstouch/tdnn_tensorflow
tdnn (time delay neural network) tensorflow implementation
bliunlpr/DeepSpeaker_PyTorch
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
syang1993/gst-tacotron
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
xcmyz/FastSpeech
The Implementation of FastSpeech based on pytorch.
upx/upx
UPX - the Ultimate Packer for eXecutables
b04901014/FG-transformer-TTS
Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.
jjery2243542/adaptive_voice_conversion
resemble-ai/Resemblyzer
A python package to analyze and compare voices with deep learning
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
wiseman/py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
bojone/flow
Keras implement of flow-based models
ShivamRajSharma/Transformer-Architectures-From-Scratch
Implementation of transformers based architecture in PyTorch.
descriptinc/melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
JeremyCCHsu/Python-Wrapper-for-World-Vocoder
A Python wrapper for the high-quality vocoder "World"
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Kyubyong/g2p
g2p: English Grapheme To Phoneme Conversion
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
WeidiXie/VGG-Speaker-Recognition
Utterance-level Aggregation For Speaker Recognition In The Wild
MontrealCorpusTools/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
Azure-Samples/cognitive-services-speech-sdk
Sample code for the Microsoft Cognitive Services Speech SDK
coqui-ai/STT
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
YoavRamon/awesome-kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
weiaicunzai/pytorch-cifar100
Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet, NasNet, Residual Attention Network, SENet, WideResNet)