SiddGururani's Stars
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
facebookresearch/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
cs-books/influential-cs-books
Most influential books on Computer Science/programming
lucidrains/stylegan2-pytorch
Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
iver56/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
sdatkinson/neural-amp-modeler
Neural network emulator for guitar amplifiers.
ming024/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
yang-song/score_sde_pytorch
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
csteinmetz1/auraloss
Collection of audio-focused loss functions in PyTorch
jaywalnut310/glow-tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
NVIDIA/NVFlare
NVIDIA Federated Learning Application Runtime Environment
facebookresearch/WavAugment
A library for speech data augmentation in time-domain
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
huawei-noah/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
adefossez/julius
Fast PyTorch based DSP for audio and 1D signals
torchsynth/torchsynth
A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.
maum-ai/univnet
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
facebookresearch/diffq
DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight or group of weights, in order to achieve a given trade-off between model size and accuracy.
ben-hayes/neural-waveshaping-synthesis
efficient neural audio synthesis in the waveform domain
mbinkowski/DeepSpeechDistances
Authors' implementation of DeepSpeech Distances.
DolbyLaboratories/neural-upsampling-artifacts-audio
Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356
keonlee9420/Daft-Exprt
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
ajinkyakulkarni14/ERISHA
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.
prosodylab/prosobeast-annotation-tool
jckane/Voice_Analysis_Toolkit
A set of Matlab code for carrying out glottal source and voice quality analysis
tunib-ai/transformers
🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed
CookiePPP/VocoderComparisons
Train/test a variety of open source vocoders using the same input features and dataset. Then infer together for easy side-by-side comparisons.
mohitzsh/ml-notes
Collection of notes I take while studying machine learning