sravanidn
I am an ML researcher and engineer at Comcast Labs with a focus on NLP, Speech Processing, Human-machine interfaces.
ML Researcher at Comcast LabsSan Francisco, California
Pinned Repositories
audacity
Audio Editor
awesome-object-detection
Awesome Object Detection based on handong1587 github: https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html
coursera-gan-specialization
Programming assignments and quizzes from all courses within the GANs specialization offered by deeplearning.ai
DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
FARM
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
LifeIns_Modling_Buy_NoBuy
pytorch_geometric
Geometric Deep Learning Extension Library for PyTorch
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
sravanidn's Repositories
sravanidn/DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
sravanidn/FARM
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
sravanidn/pytorch_geometric
Geometric Deep Learning Extension Library for PyTorch
sravanidn/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
sravanidn/audacity
Audio Editor
sravanidn/awesome-object-detection
Awesome Object Detection based on handong1587 github: https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html
sravanidn/coursera-gan-specialization
Programming assignments and quizzes from all courses within the GANs specialization offered by deeplearning.ai
sravanidn/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
sravanidn/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
sravanidn/LifeIns_Modling_Buy_NoBuy
sravanidn/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
sravanidn/awesome-speech-enhancement
A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
sravanidn/ba
Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
sravanidn/computer-science
:mortar_board: Path to a free self-taught education in Computer Science!
sravanidn/deepvoice3_pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
sravanidn/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
sravanidn/Fuji_Data
sravanidn/go-figure-kubernetes
Kubernetes environment for running go figure apps
sravanidn/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
sravanidn/machine-learning-interview
Machine Learning Interviews from FAAG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc.
sravanidn/ml-system-design-pattern
System design patterns for machine learning
sravanidn/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
sravanidn/overview
Description-FAQ of the process
sravanidn/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
sravanidn/pytorch-dc-tts
Text to Speech with PyTorch (English and Mongolian)
sravanidn/stylegan2-training
Materials for StyleGAN2 Training class
sravanidn/tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
sravanidn/talking-jobs
sravanidn/TTS-Style-Transfer
Official PyTorch implementation of TTS Style Transfer
sravanidn/WaveRNN
WaveRNN Vocoder + TTS