SiddGururani

NVIDIA ResearchSanta Clara, CA

SiddGururani's Stars

speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python9.1k 134 1.1k1.4k
facebookresearch/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Language:Python8.5k 155 5431.1k
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Language:Python8.1k 98 1.7k998
cs-books/influential-cs-books
Most influential books on Computer Science/programming
5.7k 174 14503
lucidrains/stylegan2-pytorch
Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement
Language:Python3.7k 69 263591
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
Language:Python2.3k 52 222424
iver56/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Language:Python1.9k 21 183192
sdatkinson/neural-amp-modeler
Neural network emulator for guitar amplifiers.
Language:Python1.9k 57 319155
ming024/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python1.9k 28 220543
yang-song/score_sde_pytorch
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
Language:Jupyter Notebook1.8k 16 63324
csteinmetz1/auraloss
Collection of audio-focused loss functions in PyTorch
Language:Python754 18 3767
jaywalnut310/glow-tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
Language:Python669 19 73151
NVIDIA/NVFlare
NVIDIA Federated Learning Application Runtime Environment
Language:Python659 21 310182
facebookresearch/WavAugment
A library for speech data augmentation in time-domain
Language:Python652 25 1758
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
Language:Python580 27 6788
huawei-noah/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Language:Jupyter Notebook566 23 31122
adefossez/julius
Fast PyTorch based DSP for audio and 1D signals
Language:Python431 9 1124
torchsynth/torchsynth
A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.
Language:Python331 13 16612
maum-ai/univnet
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
Language:Python266 11 946
facebookresearch/diffq
DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight or group of weights, in order to achieve a given trade-off between model size and accuracy.
Language:Python235 10 815
ben-hayes/neural-waveshaping-synthesis
efficient neural audio synthesis in the waveform domain
Language:Python185 2 814
mbinkowski/DeepSpeechDistances
Authors' implementation of DeepSpeech Distances.
Language:Jupyter Notebook129 7 412
DolbyLaboratories/neural-upsampling-artifacts-audio
Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356
Language:Jupyter Notebook77 11 04
keonlee9420/Daft-Exprt
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Language:Python56 3 713
ajinkyakulkarni14/ERISHA
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.
Language:Python43 6 318
prosodylab/prosobeast-annotation-tool
Language:Python40 9 32
jckane/Voice_Analysis_Toolkit
A set of Matlab code for carrying out glottal source and voice quality analysis
Language:MATLAB31 7 113
tunib-ai/transformers
🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed
Language:Python31 0 02
CookiePPP/VocoderComparisons
Train/test a variety of open source vocoders using the same input features and dataset. Then infer together for easy side-by-side comparisons.
Language:Python6 4 21
mohitzsh/ml-notes
Collection of notes I take while studying machine learning
Language:TeX1 1 00

SiddGururani

SiddGururani's Stars

speechbrain/speechbrain

facebookresearch/demucs

huggingface/accelerate

cs-books/influential-cs-books

lucidrains/stylegan2-pytorch

asteroid-team/asteroid

iver56/audiomentations

sdatkinson/neural-amp-modeler

ming024/FastSpeech2

yang-song/score_sde_pytorch

csteinmetz1/auraloss

jaywalnut310/glow-tts

NVIDIA/NVFlare

facebookresearch/WavAugment

xinjli/allosaurus

huawei-noah/Speech-Backbones

adefossez/julius

torchsynth/torchsynth

maum-ai/univnet

facebookresearch/diffq

ben-hayes/neural-waveshaping-synthesis

mbinkowski/DeepSpeechDistances

DolbyLaboratories/neural-upsampling-artifacts-audio

keonlee9420/Daft-Exprt

ajinkyakulkarni14/ERISHA

prosodylab/prosobeast-annotation-tool

jckane/Voice_Analysis_Toolkit

tunib-ai/transformers

CookiePPP/VocoderComparisons

mohitzsh/ml-notes