shivammehta25

PhD Student at KTH Royal Institute of Technology

@facebookMenlo Park, CA

Pinned Repositories

TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python38.8k 309 1.2k4.9k
BetterFastSpeech2
Just another FastSpeech 2 but cleaner code :)
Language:Jupyter Notebook26 2 02
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook4 1 01
Diff-TTSG
Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
Language:Python39 4 02
GitSetGo
Command Line Git Made Easy No Additional Dependencies Just Run the Script
Language:Python3 2 00
Match-TTSG
Language:Jupyter Notebook5 2 01
Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Language:Jupyter Notebook934 16 94122
Neural-HMM
Neural HMMs are all you need (for high-quality attention-free TTS)
Language:Jupyter Notebook158 6 1425
OverFlow
Putting flows on top of neural transducers for better TTS
Language:Jupyter Notebook62 6 211
ScanX
This tool used nmap and scanpbnj modules to develop a mini shodan type engine that can search according to any service running on the vairous hosts, It connects the nmap results to the database providing a proper frontend with an administrative panel.
Language:PHP4 2 03

shivammehta25's Repositories

shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Language:Jupyter Notebook934 16 94122
shivammehta25/Neural-HMM
Neural HMMs are all you need (for high-quality attention-free TTS)
Language:Jupyter Notebook158 6 1425
shivammehta25/OverFlow
Putting flows on top of neural transducers for better TTS
Language:Jupyter Notebook62 6 211
shivammehta25/Diff-TTSG
Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
Language:Python39 4 02
shivammehta25/BetterFastSpeech2
Just another FastSpeech 2 but cleaner code :)
Language:Jupyter Notebook26 2 02
shivammehta25/Match-TTSG
Language:Jupyter Notebook5 2 01
shivammehta25/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook4 1 01
shivammehta25/Matcha-TTS-checkpoints
Repository specific for hosting Matcha-TTS's checkpoints in its release. Mitigation due to the bug in gdown
3 2 00
shivammehta25/lightning-tutorials
Collection of Pytorch lightning tutorial form as rich scripts automatically transformed to ipython notebooks.
Language:Python1 1 0
shivammehta25/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Language:Python1 0 0
shivammehta25/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python0 1 00
shivammehta25/analysis-utilities
Language:Jupyter Notebook2 0
shivammehta25/Bayesian-Flow-Networks
A simple implimentation of Bayesian Flow Networks (BFN)
Language:Jupyter Notebook1 0
shivammehta25/CLAP
Contrastive Language-Audio Pretraining
Language:Python1 0
shivammehta25/conditional-flow-matching
Conditional Flow Matching: Simulation-Free Dynamic Optimal Transport
Language:Python1 0
shivammehta25/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Language:Python1 0
shivammehta25/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Language:Jupyter Notebook1 0
shivammehta25/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python1 0
shivammehta25/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python1 0
shivammehta25/Fun-Coding
I will be saving and committing everyday, Something or update Study progress or Notes.
Language:Python2 01
shivammehta25/Grad-TTS_Repo
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Language:Jupyter Notebook1 0
shivammehta25/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook1 0
shivammehta25/NeMo
NeMo: a toolkit for conversational AI
Language:Python1 0
shivammehta25/Nvidia-DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Language:Jupyter Notebook1 0
shivammehta25/open-tts-tracker
1 0
shivammehta25/rustlings
:crab: Small exercises to get you used to reading and writing Rust code!
Language:Rust1 0
shivammehta25/shivammehta25
2 01
shivammehta25/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python1 0
shivammehta25/wasp_SE_course
Resources and student assignments for the WASP Software Engineering course
Language:TeX1 0
shivammehta25/WhisperFusion
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
Language:Python1 0