shivammehta25's Stars
ggerganov/llama.cpp
LLM inference in C/C++
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
maybe-finance/maybe
The OS for your personal finances
Stability-AI/generative-models
Generative Models by Stability AI
state-spaces/mamba
Mamba SSM architecture
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
LaurentMazare/tch-rs
Rust bindings for the C++ api of PyTorch.
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS
cpeditor/cpeditor
The IDE for competitive programming :tada: | Fetch, Code, Compile, Run, Check, Submit :rocket:
ming024/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
collabora/WhisperFusion
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
sh-lee-prml/HierSpeechpp
The official implementation of HierSpeech++
Vaibhavs10/open-tts-tracker
jmtomczak/intro_dgm
"Deep Generative Modeling": Introductory Examples
atong01/conditional-flow-matching
TorchCFM: a Conditional Flow Matching library
lmnt-com/diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
willisma/SiT
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
facebookresearch/audioseal
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
allenai/OLMo-Eval
Evaluation suite for LLMs
X-LANCE/VoiceFlow-TTS
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
nnaisense/bayesian-flow-networks
This is the official code release for Bayesian Flow Networks.
voidful/Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
DavidMChan/Anim400K
Anim-400K: A dataset designed from the ground up for automated dubbing of video
probabilisticai/probai-2023
Materials of the Nordic Probabilistic AI School 2023.
Takaaki-Saeki/DiscreteSpeechMetrics
Reference-aware automatic speech evaluation toolkit
p0p4k/Matcha-TTS-2
E2E TTS using Conditional Flow Matching (Experimental*)
girisiman/girisiman.github.io