Pinned Repositories
AdaptaBERT
Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence Labeling
adaptive_voice_conversion
AdaSpeech2
AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data
AGAIN-VC
This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance Normalization.
audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
automata-from-regex
A python program to build nfa, dfa and minimised DFA from given regular expression. Uses Tkinter for GUI and GraphViz for graphs.
Chinese-Hip-pop-Generation
Generate Chinese hip-pop lyrics using GAN
Cognitive-Speech-STT-Android
Android SDK for the Microsoft Speech-to-Text API, part of Cognitive Services
Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
control-vc
This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"
EsOff's Repositories
EsOff/Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
EsOff/DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
EsOff/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
EsOff/FG-transformer-TTS
Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.
EsOff/g2pM
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
EsOff/hinglishNorm
A Hindi-English Dataset for Text Normalization
EsOff/jack2
jack2 codebase
EsOff/KnowPrompt
Code and datasets for the WWW2022 paper "KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction"
EsOff/Learn-Vim
Learning Vim and Vimscript doesn't have to be hard. This is the guide that you're looking for 📖
EsOff/lyrebird
🦜 Simple and powerful voice changer for Linux, written in GTK 3.
EsOff/MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
EsOff/mRASP
EsOff/multilingual-t5
EsOff/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
EsOff/NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
EsOff/NeuralSpeech
EsOff/PainterEngine
PainterEngine is a application/game engine with software renderer,PainterEngine can be transplanted to any platform that supports C
EsOff/PodcastMix-inference
EsOff/PortaSpeech
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
EsOff/speech-resynthesis
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
EsOff/SpeechSplit
Unsupervised Speech Decomposition Via Triple Information Bottleneck
EsOff/stanza
Official Stanford NLP Python Library for Many Human Languages
EsOff/StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
EsOff/text-to-text-transfer-transformer
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
EsOff/textlesslib
Library for Textless Spoken Language Processing
EsOff/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
EsOff/VocGAN
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
EsOff/VQMIVC
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
EsOff/Wave-U-Net
Implementation of the Wave-U-Net for audio source separation
EsOff/YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone