Charlottecuc
The University of Edinburgh | Speech Synthesis | Voice Conversion | Automatic Speech Recognition | NLP
University of EdinburghUK
Pinned Repositories
adaptive_voice_conversion
ai-research-code
Algorithm_Interview_Notes-Chinese
2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记
AlignTTS
Implementation of the AlignTTS
anomaly-detection-resources
Anomaly detection related books, papers, videos, and toolboxes
Audio-Effects
Collection of audio effects plugins implemented from the explanations in the book "Audio Effects: Theory, Implementation and Application" by Joshua D. Reiss and Andrew P. McPherson.
Audio-Style-Transfer
Github repository for inzva-ai project Audio Style Transfer
audio_style_tranfer
autotuner
CatDCGAN
A DCGAN that generate Cat pictures 🐱💻
Charlottecuc's Repositories
Charlottecuc/ai-research-code
Charlottecuc/cargan
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
Charlottecuc/Cross-Speaker-Emotion-Transfer
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Charlottecuc/CycleGAN-VC2
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
Charlottecuc/dpss-exp3-VC-PPG
Voice Conversion Experiments for THUHCSI Course : <Digital Processing of Speech Signals>
Charlottecuc/editts
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Charlottecuc/efficient_tts
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
Charlottecuc/isobar
A Python library for creating and manipulating musical patterns, designed for use in algorithmic composition, generative music and sonification. Can be used to generate MIDI events, MIDI files, OSC messages, or custom events.
Charlottecuc/malaya-speech
Speech Toolkit for bahasa Malaysia, https://malaya-speech.readthedocs.io/
Charlottecuc/Meta-TTS
Charlottecuc/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Charlottecuc/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
Charlottecuc/nnsvs
Neural network-based singing voice synthesis library for research
Charlottecuc/Notes
Some Markdown Notes...
Charlottecuc/OMGD
Online Multi-Granularity Distillation for GAN Compression (ICCV2021)
Charlottecuc/OSM-one-shot-multispeaker
Framework for one-shot multispeaker system based on Deep Learning
Charlottecuc/Parallel-Tacotron2
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
Charlottecuc/project-NN-Pytorch-scripts
Charlottecuc/pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Charlottecuc/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Charlottecuc/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
Charlottecuc/stargan
StarGAN - Official PyTorch Implementation (CVPR 2018)
Charlottecuc/StarGANv2-VC
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
Charlottecuc/StreamingCNN
To train deep convolutional neural networks, the input data and the activations need to be kept in memory. Given the limited memory available in current GPUs, this limits the maximum dimensions of the input data. Here we demonstrate a method to train convolutional neural networks while holding only parts of the image in memory.
Charlottecuc/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Charlottecuc/tuna
An audio effects library for the Web Audio API.
Charlottecuc/VAENAR-TTS
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
Charlottecuc/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Charlottecuc/VQMIVC
Official implementation of VQMIVC: One-shot Voice Conversion @ Interspeech 2021
Charlottecuc/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit