Pinned Repositories
awesome-bandwidth-expansion
This is an attempt to list interesting audio bandwidth expansion/super-resolution research works.
awesome-speaker-recognition
This is an attempt to list interesting speaker recognition research works.
awesome-speech-synthesis
Attempt to list interesting voice conversion works
PERL-samples
Enhanced samples from PERL-AE model (https://arxiv.org/abs/2010.11860)
projected-distribution-loss
My implementation of Projected Distribution Loss (PDL)
saurabh-kataria's Repositories
saurabh-kataria/9-jhu
saurabh-kataria/complex_tf
saurabh-kataria/deep_complex_networks
Implementation related to the Deep Complex Networks
saurabh-kataria/Face-Detection-Tracking-and-Clustering
We detect and track faces in video, then extract features from those face tracks and try to cluster them into given number of persons.
saurabh-kataria/handson-ml
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in python using Scikit-Learn and TensorFlow.
saurabh-kataria/improvedsegan
This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training more robust and stable.
saurabh-kataria/irasl2018
Code for the paper "Investigating the effect of residual and highway connections in speech enhancement models"
saurabh-kataria/isegan
Improved Speech Enhancement GANs
saurabh-kataria/jsalt-2019-mt-tutorial
MT Tutorial for the JSALT 2019 Summer School
saurabh-kataria/JSALT19-GluonNLP
JSALT 2019 Montréal: Dive into Deep Learning for Natural Language Processing
saurabh-kataria/jsalt2019-diadet
Repository of recipes for the JSALT2019 workshop on "Speaker Detection in Adverse Scenarios with a Single Microphone"
saurabh-kataria/latex-resumes
A collection of latex resume templates
saurabh-kataria/Matlab-toolbox-for-DNN-based-speech-separation
This folder contains Matlab programs for a toolbox for supervised speech separation using deep neural networks (DNNs).
saurabh-kataria/PRMLT
Pattern Recognition and Machine Learning Toolbox
saurabh-kataria/pysepm
Python implementation of performance metrics in Loizou's Speech Enhancement book
saurabh-kataria/Quadruplets-Network
Implementation of the Quadruplets Network and Quadruplets Loss as described in "Beyond triplet loss: a deep quadruplet network for person re-identification" .
saurabh-kataria/RawNet
saurabh-kataria/rnn-speech-denoising
Recurrent neural network training for noise reduction in robust automatic speech recognition
saurabh-kataria/SEGAN-1
A PyTorch implementation of SEGAN based on INTERSPEECH 2017 paper "SEGAN: Speech Enhancement Generative Adversarial Network"
saurabh-kataria/segan-pytorch
SEGAN pytorch implementation https://arxiv.org/abs/1703.09452
saurabh-kataria/segan-tfworked
Speech Enhancement Generative Adversarial Network
saurabh-kataria/segan_pytorch
Speech Enhancement Generative Adversarial Network in PyTorch
saurabh-kataria/speech-denoising-wavenet
A neural network for end-to-end speech denoising
saurabh-kataria/SpeechDenoisingWithDeepFeatureLosses
Speech Denoising with Deep Feature Losses
saurabh-kataria/speedtest-cli
Command line interface for testing internet bandwidth using speedtest.net
saurabh-kataria/tensorflow-workshop
This repo contains materials for use in a TensorFlow workshop.
saurabh-kataria/UGATIT
Official Tensorflow implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation
saurabh-kataria/unimatrix
Python script to simulate the display from "The Matrix" in terminal. Uses half-width katakana unicode characters by default, but can use custom character sets. Accepts keyboard controls while running. Based on CMatrix.
saurabh-kataria/Wave-U-Net-For-Speech-Enhancement
Improved speech enhancement with the Wave-U-Net. Applying Stoller et al's deep convolutional neural network architecture to speech enhancement in the time-domain.