Pinned Repositories
AdvDCTTS
Implementation of DCTTS with Adversarial Training
Adversarial-Many-to-Many-VC
audio2vec
AudioStyleNet
This repository contains the code for my master thesis on Emotion-Aware Facial Animation
Catch-A-Waveform
Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
clpcnet
Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)
conformer
Implementation of the convolutional module from the Conformer paper, for use in Transformers
CVC
CVC: Contrastive Learning for Non-parallel Voice Conversion (submitted to ICASSP 2021, in PyTorch)
deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System.
DNS-Challenge
This repo contains the scripts, models, and required files for the ICASSP 2021 Deep Noise Suppression (DNS) Challenge.
taalua's Repositories
taalua/AudioStyleNet
This repository contains the code for my master thesis on Emotion-Aware Facial Animation
taalua/Catch-A-Waveform
Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
taalua/clpcnet
Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)
taalua/editts
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
taalua/FastSpeech2
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech :fist:
taalua/few-shot-transformer-tts
Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.
taalua/FG-transformer-TTS
Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.
taalua/flow_synthesizer
Universal audio synthesizer control learning with normalizing flows
taalua/flowEQ
β-VAE for intelligent control of a five band parametric EQ
taalua/g2p
g2p: English Grapheme To Phoneme Conversion
taalua/inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
taalua/jax-variational-diffwave
Jax/Flax implementation of Variational-DiffWave.
taalua/MaskCycleGAN-VC
Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.
taalua/mir-svc
Unsupervised WaveNet-based Singing Voice Conversion Using Pitch Augmentation and Two-phase Approach
taalua/mixture-of-experts
PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538
taalua/msaf
Music Structure Analysis Framework
taalua/MTL-Speaker-Embeddings
Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" presented at Interspeech 2021
taalua/Neural-HMM
Neural HMMs are all you need (for high-quality attention-free TTS)
taalua/normalizing-flows
PyTorch implementation of normalizing flow models
taalua/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
taalua/SSL_Anti-spoofing
This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".
taalua/ssqueezepy
Synchrosqueezing, wavelet transforms, and time-frequency analysis in Python
taalua/stereoEEG2speech
Code for a seq2seq architecture with Bahdanau attention designed to map stereotactic EEG data from human brains to spectrograms, using the PyTorch Lightning.
taalua/taalua
Config files for my GitHub profile.
taalua/tt-vae-gan
Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.
taalua/voice_conversion
taalua/voicesmith
[WIP] VoiceSmith makes training text to speech models easy.
taalua/WaveGrad
Implementation of Google Brain's WaveGrad vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
taalua/wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
taalua/x-vector-pytorch