SiddGururani

NVIDIA ResearchSanta Clara, CA

SiddGururani's Stars

eugeneyan/applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
27.2k 948 243.7k
adamian98/pulse
PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models
Language:Python7.9k 227 851.5k
jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language:Python1.9k 31 162504
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Language:Jupyter Notebook1.5k 45 255340
belangeo/pyo
Python DSP module
Language:Python1.3k 66 238131
philipperemy/deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System.
Language:Python901 49 85240
aliutkus/speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
Language:Python894 23 33153
NVIDIA/mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
Language:Jupyter Notebook854 30 95184
spotify/klio
Smarter data pipelines for audio.
Language:Python835 20 648
Tomiinek/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Language:Python826 31 79157
auspicious3000/SpeechSplit
Unsupervised Speech Decomposition Via Triple Information Bottleneck
Language:Python637 23 7192
kamenbliznashki/normalizing_flows
Pytorch implementations of density estimation algorithms: BNAF, Glow, MAF, RealNVP, planar flows
Language:Python600 16 15101
liusongxiang/StarGAN-Voice-Conversion
This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks
Language:Python511 21 2193
rosinality/glow-pytorch
PyTorch implementation of Glow
Language:Python508 9 4297
ivanvovk/WaveGrad
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
Language:Jupyter Notebook402 17 2655
pranaymanocha/PerceptualAudio
Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM
Language:Python352 10 1933
vBaiCai/python-pesq
A python package for calculating the PESQ.
Language:Python352 11 2569
GuitarML/SmartGuitarPedal
Guitar plugin made with JUCE that uses neural network models to emulate real world hardware.
Language:C++263 14 1524
huyanxin/phasen
A unofficial Pytorch implementation of Microsoft's PHASEN
Language:Python221 9 1350
craigmacartney/Wave-U-Net-For-Speech-Enhancement
Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemented for the task of speech enhancement in the time-domain.
Language:Python212 8 1039
yistLin/FragmentVC
Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
Language:Python197 13 2738
asuni/wavelet_prosody_toolkit
Language:Python180 5 1841
acids-ircam/flow_synthesizer
Universal audio synthesizer control learning with normalizing flows
Language:Max132 12 422
zomux/lanmt
LaNMT: Latent-variable Non-autoregressive Neural Machine Translation with Deterministic Inference
Language:Python79 5 94
L0SG/NanoFlow
PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity." (NeurIPS 2020)
Language:Python64 3 04
adrienchaton/PerceptualAudio_Pytorch
Pytorch implementation of "A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences", Pranay Manocha et al. - unofficial work in progress
Language:Python60 1 32
russellgeum/Phase-aware-Deep-Complex-UNet
[Not Official] Implementation DC-UNet, ICLR 2019
Language:Python55 5 218
thuhcsi/icassp2021-emotion-tts
Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/
Language:Python33 2 513
ViEm-ccy/GEDLoss_pytorch
a pytorch implementation of Google GEDLoss
Language:Python32 2 12
hifi-gan/code01
Language:Python15 2 05

SiddGururani

SiddGururani's Stars

eugeneyan/applied-ml

adamian98/pulse

jik876/hifi-gan

kan-bayashi/ParallelWaveGAN

belangeo/pyo

philipperemy/deep-speaker

aliutkus/speechmetrics

NVIDIA/mellotron

spotify/klio

Tomiinek/Multilingual_Text_to_Speech

auspicious3000/SpeechSplit

kamenbliznashki/normalizing_flows

liusongxiang/StarGAN-Voice-Conversion

rosinality/glow-pytorch

ivanvovk/WaveGrad

pranaymanocha/PerceptualAudio

vBaiCai/python-pesq

GuitarML/SmartGuitarPedal

huyanxin/phasen

craigmacartney/Wave-U-Net-For-Speech-Enhancement

yistLin/FragmentVC

asuni/wavelet_prosody_toolkit

acids-ircam/flow_synthesizer

zomux/lanmt

L0SG/NanoFlow

adrienchaton/PerceptualAudio_Pytorch

russellgeum/Phase-aware-Deep-Complex-UNet

thuhcsi/icassp2021-emotion-tts

ViEm-ccy/GEDLoss_pytorch

hifi-gan/code01