pilarOG
Third year PhD student in Speech Synthesis. I love machine learning, Python and phonetics and I want to give everybody the chance to learn them!
University of EdinburghEdinburgh, Scotland
pilarOG's Stars
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
vdumoulin/conv_arithmetic
A technical report on convolution arithmetic in the context of deep learning
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
google-research/vision_transformer
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
facebookresearch/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
MontrealCorpusTools/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
Separius/awesome-fast-attention
list of efficient attention modules
nlp-uoregon/trankit
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
jefflai108/Contrastive-Predictive-Coding-PyTorch
Contrastive Predictive Coding for Automatic Speaker Verification
Jackson-Kang/Pytorch-VAE-tutorial
A simple tutorial of Variational AutoEncoders with Pytorch
bminixhofer/nnsplit
Semantic text segmentation. For sentence boundary detection, compound splitting and more.
cmsflash/efficient-attention
An implementation of the efficient attention module.
swasun/VQ-VAE-Speech
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
WasifurRahman/BERT_multimodal_transformer
jinhan/tacotron2-vae
Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"
janfreyberg/pytorch-revgrad
A minimal pytorch package implementing a gradient reversal layer.
diegma/graph-2-text
Graph to sequence implemented in Pytorch combining Graph convolutional networks and opennmt-py
kylebgorman/swipe
A pitch tracker using Camacho's SWIPE' algorithm, written in C
eemlcommunity/PracticalSessions2021
Emotional-Text-to-Speech/hmm-for-emo-tts
:computer: A repository with comprehensive instructions for using the Festvox toolkit for generating Emotional speech :speaker: from text
CSTR-Edinburgh/qualtreats
Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.
sagorbrur/codeswitch
CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed data.
laic/uoe_speech_processing_course
BorealisAI/cross_domain_coherence
A Cross-Domain Transferable Neural Coherence Model https://arxiv.org/abs/1905.11912
jbeliao/SLAM
taasnim/unified-coherence-model
timmahrt/LMEDS
Language Markup and Experimental Design Software -- for running experiments over the internet
fievelk/pylade
PyLaDe - Language Detection tool.