bhairavmehta95
on leave from my @mit EECS PhD, building B2B software. previously MS @mila-iqia. BS @umich, @nasa-jpl, @NVlabs.
@ :) New York, NY
bhairavmehta95's Stars
ollama/ollama
Get up and running with Llama 3, Mistral, Gemma, and other large language models.
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
reworkd/AgentGPT
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
zilliztech/GPTCache
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
udlbook/udlbook
Understanding Deep Learning - Simon J.D. Prince
probml/pml-book
"Probabilistic Machine Learning" - a book series by Kevin Murphy
dhowe/AdNauseam
AdNauseam: Fight back against advertising surveillance
0nutation/SpeechGPT
SpeechGPT Series: Speech Large Language Models
innovatorved/whisper.api
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
ericyangyu/PPO-for-Beginners
A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.
PABannier/bark.cpp
Suno AI's Bark model in C/C++ for fast text-to-speech
facebookresearch/NeuralCompression
A collection of tools for neural compression enthusiasts.
daniilrobnikov/vits2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
cccntu/minLoRA
minLoRA: a minimal PyTorch library that allows you to apply LoRA to any PyTorch model.
MasayaKawamura/MB-iSTFT-VITS
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
Alkaar/resy-booking-bot
🔫 Helps to snipe hard to get reservations at restaurants that use resy
rishikksh20/iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
hayeong0/Diff-HierVC
Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation"
sp-uhh/storm
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
imdanboy/jets
JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech
fakufaku/diffusion-separation
Single channel speech source separation by diffusion process (ICASSP 2023)
rishikksh20/LightSpeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
wonjune-kang/lvc-vc
End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions
rishikksh20/iSTFT-Avocodo-pytorch
Ultrafast GAN based Vocoder for Text to Speech
yl4579/SLMGAN
SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs
vtuber-plan/iSTFTNet
iSTFTNet Vocoder PyTorch Implement
WelkinYang/multi-gradspeech