pilarOG

Third year PhD student in Speech Synthesis. I love machine learning, Python and phonetics and I want to give everybody the chance to learn them!

University of EdinburghEdinburgh, Scotland

pilarOG's Stars

facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.5k 426 4.2k6.4k
vdumoulin/conv_arithmetic
A technical report on convolution arithmetic in the context of deep learning
Language:TeX14.1k 343 312.3k
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Language:Jupyter Notebook13.5k 296 8433.2k
google-research/vision_transformer
Language:Jupyter Notebook10.4k 105 2071.3k
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python8.9k 135 1.1k1.4k
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6.3k 71 993776
facebookresearch/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python1.7k 36 149302
MontrealCorpusTools/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
Language:Python1.3k 36 714247
Separius/awesome-fast-attention
list of efficient attention modules
Language:Python990 32 3108
nlp-uoregon/trankit
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Language:Python734 21 78101
jefflai108/Contrastive-Predictive-Coding-PyTorch
Contrastive Predictive Coding for Automatic Speaker Verification
Language:Python480 4 2198
Jackson-Kang/Pytorch-VAE-tutorial
A simple tutorial of Variational AutoEncoders with Pytorch
Language:Jupyter Notebook328 3 476
bminixhofer/nnsplit
Semantic text segmentation. For sentence boundary detection, compound splitting and more.
Language:Rust301 8 3225
cmsflash/efficient-attention
An implementation of the efficient attention module.
Language:Python283 6 1326
swasun/VQ-VAE-Speech
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
Language:Python263 13 553
WasifurRahman/BERT_multimodal_transformer
Language:Python194 8 2030
jinhan/tacotron2-vae
Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"
Language:Jupyter Notebook166 10 633
janfreyberg/pytorch-revgrad
A minimal pytorch package implementing a gradient reversal layer.
Language:Python155 3 414
diegma/graph-2-text
Graph to sequence implemented in Pytorch combining Graph convolutional networks and opennmt-py
Language:Python151 8 1328
kylebgorman/swipe
A pitch tracker using Camacho's SWIPE' algorithm, written in C
Language:C85 9 826
eemlcommunity/PracticalSessions2021
Language:Jupyter Notebook62 9 035
Emotional-Text-to-Speech/hmm-for-emo-tts
:computer: A repository with comprehensive instructions for using the Festvox toolkit for generating Emotional speech :speaker: from text
Language:CSS46 2 110
CSTR-Edinburgh/qualtreats
Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.
Language:Python31 10 615
sagorbrur/codeswitch
CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed data.
Language:Jupyter Notebook31 4 06
laic/uoe_speech_processing_course
Language:Jupyter Notebook28 7 020
BorealisAI/cross_domain_coherence
A Cross-Domain Transferable Neural Coherence Model https://arxiv.org/abs/1905.11912
Language:Python24 5 15
jbeliao/SLAM
Language:Python16 5 04
taasnim/unified-coherence-model
Language:Python15 2 19
timmahrt/LMEDS
Language Markup and Experimental Design Software -- for running experiments over the internet
Language:Python12 4 26
fievelk/pylade
PyLaDe - Language Detection tool.
Language:Python5 1 01

pilarOG

pilarOG's Stars

facebookresearch/fairseq

vdumoulin/conv_arithmetic

NVIDIA/DeepLearningExamples

google-research/vision_transformer

speechbrain/speechbrain

pyannote/pyannote-audio

facebookresearch/denoiser

MontrealCorpusTools/Montreal-Forced-Aligner

Separius/awesome-fast-attention

nlp-uoregon/trankit

jefflai108/Contrastive-Predictive-Coding-PyTorch

Jackson-Kang/Pytorch-VAE-tutorial

bminixhofer/nnsplit

cmsflash/efficient-attention

swasun/VQ-VAE-Speech

WasifurRahman/BERT_multimodal_transformer

jinhan/tacotron2-vae

janfreyberg/pytorch-revgrad

diegma/graph-2-text

kylebgorman/swipe

eemlcommunity/PracticalSessions2021

Emotional-Text-to-Speech/hmm-for-emo-tts

CSTR-Edinburgh/qualtreats

sagorbrur/codeswitch

laic/uoe_speech_processing_course

BorealisAI/cross_domain_coherence

jbeliao/SLAM

taasnim/unified-coherence-model

timmahrt/LMEDS

fievelk/pylade