shivammehta25

PhD Student at KTH Royal Institute of Technology

@KTHStockholm, Sweden

shivammehta25's Stars

ggerganov/llama.cpp
LLM inference in C/C++
Language:C++60.9k 516 3.3k8.7k
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook33.6k 350 623.5k
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python29.7k 424 4.2k6.3k
maybe-finance/maybe
The OS for your personal finances
Language:Ruby28.3k 147 2732.2k
Stability-AI/generative-models
Generative Models by Stability AI
Language:Python23.1k 250 2742.6k
state-spaces/mamba
Mamba SSM architecture
Language:Python11.4k 97 379924
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Jupyter Notebook10.5k 139 3281k
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Language:Jupyter Notebook8.3k 93 363696
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
Language:Python4.2k 40 164388
LaurentMazare/tch-rs
Rust bindings for the C++ api of PyTorch.
Language:Rust4k 51 549317
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS
Language:Python3.4k 78 115611
cpeditor/cpeditor
The IDE for competitive programming :tada: | Fetch, Code, Compile, Run, Check, Submit :rocket:
Language:C++1.7k 24 443128
ming024/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python1.7k 27 211513
collabora/WhisperFusion
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
Language:Python1.4k 16 3098
sh-lee-prml/HierSpeechpp
The official implementation of HierSpeech++
Language:Python1.1k 58 45134
Vaibhavs10/open-tts-tracker
1.1k 60 1566
jmtomczak/intro_dgm
"Deep Generative Modeling": Introductory Examples
Language:Jupyter Notebook935 26 5156
atong01/conditional-flow-matching
TorchCFM: a Conditional Flow Matching library
Language:Python864 14 3857
lmnt-com/diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Language:Python730 21 47110
willisma/SiT
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
Language:Python538 10 1722
facebookresearch/audioseal
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
Language:Python340 16 1538
allenai/OLMo-Eval
Evaluation suite for LLMs
Language:Python275 6 429
X-LANCE/VoiceFlow-TTS
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
Language:Python254 16 1320
nnaisense/bayesian-flow-networks
This is the official code release for Bayesian Flow Networks.
Language:Python215 12 422
voidful/Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
Language:Python181 12 1620
DavidMChan/Anim400K
Anim-400K: A dataset designed from the ground up for automated dubbing of video
93 7 01
probabilisticai/probai-2023
Materials of the Nordic Probabilistic AI School 2023.
Language:Jupyter Notebook87 6 017
Takaaki-Saeki/DiscreteSpeechMetrics
Reference-aware automatic speech evaluation toolkit
Language:Python74 4 25
p0p4k/Matcha-TTS-2
E2E TTS using Conditional Flow Matching (Experimental*)
Language:Jupyter Notebook57 10 35
girisiman/girisiman.github.io
Language:HTML10

shivammehta25

shivammehta25's Stars

ggerganov/llama.cpp

mlabonne/llm-course

facebookresearch/fairseq

maybe-finance/maybe

Stability-AI/generative-models

state-spaces/mamba

facebookresearch/seamless_communication

facebookresearch/dinov2

allenai/OLMo

LaurentMazare/tch-rs

metavoiceio/metavoice-src

cpeditor/cpeditor

ming024/FastSpeech2

collabora/WhisperFusion

sh-lee-prml/HierSpeechpp

Vaibhavs10/open-tts-tracker

jmtomczak/intro_dgm

atong01/conditional-flow-matching

lmnt-com/diffwave

willisma/SiT

facebookresearch/audioseal

allenai/OLMo-Eval

X-LANCE/VoiceFlow-TTS

nnaisense/bayesian-flow-networks

voidful/Codec-SUPERB

DavidMChan/Anim400K

probabilisticai/probai-2023

Takaaki-Saeki/DiscreteSpeechMetrics

p0p4k/Matcha-TTS-2

girisiman/girisiman.github.io