sravanidn

I am an ML researcher and engineer at Comcast Labs with a focus on NLP, Speech Processing, Human-machine interfaces.

ML Researcher at Comcast LabsSan Francisco, California

Pinned Repositories

audacity
Audio Editor
Language:C0 1 00
awesome-object-detection
Awesome Object Detection based on handong1587 github: https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html
0 1 00
coursera-gan-specialization
Programming assignments and quizzes from all courses within the GANs specialization offered by deeplearning.ai
Language:Jupyter Notebook0 1 00
DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Language:C++0 1 00
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python0 1 00
FARM
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
Language:Python1 1 00
LifeIns_Modling_Buy_NoBuy
Language:R0 2 00
pytorch_geometric
Geometric Deep Learning Extension Library for PyTorch
Language:Python1 1 00
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python1 1 00
Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
Language:Python0 1 00

sravanidn's Repositories

sravanidn/DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
Language:Python1 1 00
sravanidn/FARM
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
Language:Python1 1 00
sravanidn/pytorch_geometric
Geometric Deep Learning Extension Library for PyTorch
Language:Python1 1 00
sravanidn/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python1 1 00
sravanidn/audacity
Audio Editor
Language:C0 1 00
sravanidn/awesome-object-detection
Awesome Object Detection based on handong1587 github: https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html
0 1 00
sravanidn/coursera-gan-specialization
Programming assignments and quizzes from all courses within the GANs specialization offered by deeplearning.ai
Language:Jupyter Notebook0 1 00
sravanidn/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Language:C++0 1 00
sravanidn/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python0 1 00
sravanidn/LifeIns_Modling_Buy_NoBuy
Language:R0 2 00
sravanidn/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
Language:Python0 1 00
sravanidn/awesome-speech-enhancement
A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
1 0
sravanidn/ba
Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
Language:Jupyter Notebook0 0
sravanidn/computer-science
:mortar_board: Path to a free self-taught education in Computer Science!
1 0
sravanidn/deepvoice3_pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Language:Python1 0
sravanidn/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Language:Python1 0
sravanidn/Fuji_Data
Language:Visual Basic2 0
sravanidn/go-figure-kubernetes
Kubernetes environment for running go figure apps
Language:Shell1 0
sravanidn/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language:Python1 0
sravanidn/machine-learning-interview
Machine Learning Interviews from FAAG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc.
1 0
sravanidn/ml-system-design-pattern
System design patterns for machine learning
0 0
sravanidn/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Language:Python1 0
sravanidn/overview
Description-FAQ of the process
1 0
sravanidn/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Language:Jupyter Notebook1 0
sravanidn/pytorch-dc-tts
Text to Speech with PyTorch (English and Mongolian)
Language:Jupyter Notebook1 0
sravanidn/stylegan2-training
Materials for StyleGAN2 Training class
Language:Jupyter Notebook1 0
sravanidn/tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Language:Python1 0
sravanidn/talking-jobs
1 0
sravanidn/TTS-Style-Transfer
Official PyTorch implementation of TTS Style Transfer
Language:Python1 0
sravanidn/WaveRNN
WaveRNN Vocoder + TTS
Language:Python1 0