shivammehta25's Stars
abertsch72/unlimiformer
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
calcom/cal.com
Scheduling infrastructure for absolutely everyone.
google/visqol
Perceptual Quality Estimator for speech and audio
HarisIqbal88/PlotNeuralNet
Latex code for making neural networks diagrams
kkroening/ffmpeg-python
Python bindings for FFmpeg - with complex filtering support
declare-lab/tango
A family of diffusion models for text-to-audio generation.
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Stability-AI/StableLM
StableLM: Stability AI Language Models
muramasa2/paper_summary
huggingface/diffusion-models-class
Materials for the Hugging Face Diffusion Models Course
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
TobiasNorlund/retro
Official repo to On the Generalization Ability of Retrieval-Enhanced Transformers
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
norvig/pytudes
Python programs, usually short, of considerable difficulty, to perfect particular skills.
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
dair-ai/ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
coqui-ai/Trainer
🐸 - A general purpose model trainer, as flexible as it gets
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
karpathy/micrograd
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
python-poetry/poetry
Python packaging and dependency management made easy
lucidrains/muse-maskgit-pytorch
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
Zain-Jiang/Speech-Editing-Toolkit
It's a repository for implementations of neural speech editing algorithms.
biox/pa
a simple password manager. encryption via age, written in portable posix shell
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
kakaobrain/rq-vae-transformer
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
shivammehta25/Neural-HMM
Neural HMMs are all you need (for high-quality attention-free TTS)
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
ashleve/lightning-hydra-template
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡