Charlottecuc

The University of Edinburgh | Speech Synthesis | Voice Conversion | Automatic Speech Recognition | NLP

University of EdinburghUK

Pinned Repositories

adaptive_voice_conversion
Language:Python0 1 01
ai-research-code
Language:Python0 1 00
Algorithm_Interview_Notes-Chinese
2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记
Language:Python1 1 00
AlignTTS
Implementation of the AlignTTS
Language:Jupyter Notebook0 1 00
anomaly-detection-resources
Anomaly detection related books, papers, videos, and toolboxes
Language:Python0 1 00
Audio-Effects
Collection of audio effects plugins implemented from the explanations in the book "Audio Effects: Theory, Implementation and Application" by Joshua D. Reiss and Andrew P. McPherson.
Language:C++0 1 00
Audio-Style-Transfer
Github repository for inzva-ai project Audio Style Transfer
Language:Python0 1 00
audio_style_tranfer
Language:Jupyter Notebook0 1 00
autotuner
Language:Python0 1 00
CatDCGAN
A DCGAN that generate Cat pictures 🐱‍💻
Language:Jupyter Notebook1 1 00

Charlottecuc's Repositories

Charlottecuc/ai-research-code
Language:Python0 1 00
Charlottecuc/cargan
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
Language:Python1 0
Charlottecuc/Cross-Speaker-Emotion-Transfer
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Language:Python1 0
Charlottecuc/CycleGAN-VC2
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
Language:Python1 0
Charlottecuc/dpss-exp3-VC-PPG
Voice Conversion Experiments for THUHCSI Course : <Digital Processing of Speech Signals>
Language:Python1 0
Charlottecuc/editts
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Language:Python1 0
Charlottecuc/efficient_tts
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
Language:Python1 0
Charlottecuc/isobar
A Python library for creating and manipulating musical patterns, designed for use in algorithmic composition, generative music and sonification. Can be used to generate MIDI events, MIDI files, OSC messages, or custom events.
Language:Python1 0
Charlottecuc/malaya-speech
Speech Toolkit for bahasa Malaysia, https://malaya-speech.readthedocs.io/
Language:Jupyter Notebook1 0
Charlottecuc/Meta-TTS
Language:Python1 0
Charlottecuc/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:JavaScript1 0
Charlottecuc/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
Language:Python1 0
Charlottecuc/nnsvs
Neural network-based singing voice synthesis library for research
Language:Python1 0
Charlottecuc/Notes
Some Markdown Notes...
Language:Jupyter Notebook1 0
Charlottecuc/OMGD
Online Multi-Granularity Distillation for GAN Compression (ICCV2021)
Language:Python1 0
Charlottecuc/OSM-one-shot-multispeaker
Framework for one-shot multispeaker system based on Deep Learning
Language:Python1 0
Charlottecuc/Parallel-Tacotron2
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
Language:Python1 0
Charlottecuc/project-NN-Pytorch-scripts
Language:Jupyter Notebook1 0
Charlottecuc/pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Language:Python1 0
Charlottecuc/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python1 0
Charlottecuc/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
Language:Python1 0
Charlottecuc/stargan
StarGAN - Official PyTorch Implementation (CVPR 2018)
Language:Python1 0
Charlottecuc/StarGANv2-VC
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
Language:Python1 0
Charlottecuc/StreamingCNN
To train deep convolutional neural networks, the input data and the activations need to be kept in memory. Given the limited memory available in current GPUs, this limits the maximum dimensions of the input data. Here we demonstrate a method to train convolutional neural networks while holding only parts of the image in memory.
Language:Jupyter Notebook1 0
Charlottecuc/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Jupyter Notebook1 0
Charlottecuc/tuna
An audio effects library for the Web Audio API.
Language:JavaScript1 0
Charlottecuc/VAENAR-TTS
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
Language:Python1 0
Charlottecuc/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python1 0
Charlottecuc/VQMIVC
Official implementation of VQMIVC: One-shot Voice Conversion @ Interspeech 2021
Language:Python1 0
Charlottecuc/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:C++1 0