Pinned Repositories
AdaptaBERT
Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence Labeling
adaptive_voice_conversion
AdaSpeech2
AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data
AGAIN-VC
This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance Normalization.
audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
automata-from-regex
A python program to build nfa, dfa and minimised DFA from given regular expression. Uses Tkinter for GUI and GraphViz for graphs.
Chinese-Hip-pop-Generation
Generate Chinese hip-pop lyrics using GAN
Cognitive-Speech-STT-Android
Android SDK for the Microsoft Speech-to-Text API, part of Cognitive Services
Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS
control-vc
This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"
EsOff's Repositories
EsOff/audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
EsOff/control-vc
This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"
EsOff/CoSoD
Collaborative Song Dataset
EsOff/De-limiter
An official repository of "Music De-limiter Networks via Sample-wise Gain Inversion", which will be presented in WASPAA 2023.
EsOff/DeepLearningExamples
Deep Learning Examples
EsOff/DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
EsOff/espnet
End-to-End Speech Processing Toolkit
EsOff/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
EsOff/flashlight
A C++ standalone library for machine learning
EsOff/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
EsOff/icu
The new home of the ICU project source code.
EsOff/indonlu
The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)
EsOff/k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
EsOff/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
EsOff/Leaderboard
SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.
EsOff/lll-tts
ICASSP 2022
EsOff/muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
EsOff/ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
EsOff/NeMo
NeMo: a toolkit for conversational AI
EsOff/nix-tts
š¤ Nix-TTS: An Incredibly Lightweight End-to-End Text-to-Speech Model via Non End-to-End Distillation
EsOff/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
EsOff/openvino
OpenVINOā¢ Toolkit repository
EsOff/phonemizer
Simple text to phones converter for multiple languages
EsOff/RE2NN-SEQ
Source code for the EMNLP2021 paper: "Neuraling Regular Expressions for Slot Filling"
EsOff/torch-nansypp
NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
EsOff/transformer_cpp_tokenizers
transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)
EsOff/vector-quantize-pytorch
Vector Quantization, in Pytorch
EsOff/VoiceMe
Repository for the paper: VoiceMe: Personalized voice generation in TTS
EsOff/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
EsOff/WeTextProcessing
Text Normalization & Inverse Text Normalization