brijmohan's Stars
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
NVIDIA/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
espeak-ng/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
facebookresearch/nevergrad
A Python toolbox for performing gradient-free optimization
huggingface/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
tomgoldstein/loss-landscape
Code for visualizing the loss landscape of neural nets
JavierAntoran/Bayesian-Neural-Networks
Pytorch implementations of Bayes By Backprop, MC Dropout, SGLD, the Local Reparametrization Trick, KF-Laplace, SG-HMC and more
pytorch/opacus
Training PyTorch models with differential privacy
freewym/espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
jjery2243542/adaptive_voice_conversion
StaticMania/portio-hugo
Portio Hugo is a simple, minimal and responsive Portfolio Hugo Theme. Portio Hugo is well organized, well-formatted, and named accordingly so it’s easy to change any and all of the design. Portio is built with Bootstrap 4. You can customize it very easily to fit your needs.
lfwa/carbontracker
Track and predict the energy consumption and carbon footprint of training deep learning models.
dilinwang820/Stein-Variational-Gradient-Descent
code for the paper "Stein Variational Gradient Descent (SVGD): A General Purpose Bayesian Inference Algorithm"
lochenchou/MOSNet
Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
ksw0306/ClariNet
A Pytorch Implementation of ClariNet
jjery2243542/voice_conversion
f0nzie/tikz_favorites
collection of favorite TikZ graphics
sequitur-g2p/sequitur-g2p
This is a github repository of the abandonware Sequitur G2P by Bisani & Ney
cvqluu/Factorized-TDNN
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
LaRiffle/collateral-learning
Collateral Learning - Functional Encryption and Adversarial Training on partially encrypted networks
idiap/pkwrap
A pytorch wrapper for LF-MMI training and parallel training in Kaldi
higuma/wav-audio-encoder-js
Waveform Audio encoder for browsers
Voice-Privacy-Challenge/Voice-Privacy-Challenge-2022
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
vvestman/pytorch-ivectors
GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An example script is provided for VoxCeleb data.
Voice-Privacy-Challenge/Voice-Privacy-Challenge-2020
Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf
kan-bayashi/INTERSPEECH19_TUTORIAL
Interspeech 2019 tutorial materials
microsoft/INMT-lite
Interactive Neural Machine Translation-lite (INMT-lite) is a framework to train and develop lite versions (.tflite) of models for neural machine translation (NMT) that can be run on embedded devices like mobile phones and tablets that have low computation power and space. The tflite models generated can be used to build the offline version of INMT mobile, a mobile version of INMT web.
deep-privacy/SA-toolkit
SA-toolkit: Speaker speech anonymization toolkit in python
CRIStAL-Sigma/phd-thesis-template
Template for PhD thesis using Tufte's style book
deep-privacy/espnet
End-to-End Speech Processing Toolkit