taalua

Pinned Repositories

AdvDCTTS
Implementation of DCTTS with Adversarial Training
Language:Python0 0 00
Adversarial-Many-to-Many-VC
Language:Python0 0 00
audio2vec
Language:Jupyter Notebook0 0 00
AudioStyleNet
This repository contains the code for my master thesis on Emotion-Aware Facial Animation
Language:Jupyter Notebook0 0 00
Catch-A-Waveform
Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
Language:Python0 0 00
clpcnet
Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)
Language:Python0 0 00
conformer
Implementation of the convolutional module from the Conformer paper, for use in Transformers
Language:Python0 0 00
CVC
CVC: Contrastive Learning for Non-parallel Voice Conversion (submitted to ICASSP 2021, in PyTorch)
Language:Python0 0 00
deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System.
Language:Python0 0 00
DNS-Challenge
This repo contains the scripts, models, and required files for the ICASSP 2021 Deep Noise Suppression (DNS) Challenge.
0 0 00

taalua's Repositories

taalua/AudioStyleNet
This repository contains the code for my master thesis on Emotion-Aware Facial Animation
Language:Jupyter Notebook0 0 00
taalua/Catch-A-Waveform
Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
Language:Python0 0 00
taalua/clpcnet
Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)
Language:Python0 0 00
taalua/editts
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Language:Python0 0
taalua/FastSpeech2
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech :fist:
Language:Python0 0
taalua/few-shot-transformer-tts
Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.
Language:Python0 0
taalua/FG-transformer-TTS
Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.
Language:Python0 0
taalua/flow_synthesizer
Universal audio synthesizer control learning with normalizing flows
Language:Max0 0
taalua/flowEQ
β-VAE for intelligent control of a five band parametric EQ
Language:MATLAB0 0
taalua/g2p
g2p: English Grapheme To Phoneme Conversion
Language:Python0 0
taalua/inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Language:Python0 0
taalua/jax-variational-diffwave
Jax/Flax implementation of Variational-DiffWave.
Language:Python0 0
taalua/MaskCycleGAN-VC
Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.
Language:Python0 0
taalua/mir-svc
Unsupervised WaveNet-based Singing Voice Conversion Using Pitch Augmentation and Two-phase Approach
Language:Python0 0
taalua/mixture-of-experts
PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538
taalua/msaf
Music Structure Analysis Framework
Language:Python0 0
taalua/MTL-Speaker-Embeddings
Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" presented at Interspeech 2021
Language:Python0 0
taalua/Neural-HMM
Neural HMMs are all you need (for high-quality attention-free TTS)
taalua/normalizing-flows
PyTorch implementation of normalizing flow models
Language:Python0 0
taalua/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Language:Jupyter Notebook0 0
taalua/SSL_Anti-spoofing
This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".
Language:Python0 0
taalua/ssqueezepy
Synchrosqueezing, wavelet transforms, and time-frequency analysis in Python
Language:Python0 0
taalua/stereoEEG2speech
Code for a seq2seq architecture with Bahdanau attention designed to map stereotactic EEG data from human brains to spectrograms, using the PyTorch Lightning.
Language:Python0 0
taalua/taalua
Config files for my GitHub profile.
1 0
taalua/tt-vae-gan
Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.
Language:Python0 0
taalua/voice_conversion
Language:Python0 0
taalua/voicesmith
[WIP] VoiceSmith makes training text to speech models easy.
Language:Python0 0
taalua/WaveGrad
Implementation of Google Brain's WaveGrad vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
Language:Jupyter Notebook
taalua/wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
Language:Python0 0
taalua/x-vector-pytorch
Language:Python0 0