Pinned Repositories
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
BetterFastSpeech2
Just another FastSpeech 2 but cleaner code :)
Diff-TTSG
Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
GitSetGo
Command Line Git Made Easy No Additional Dependencies Just Run the Script
Match-TTSG
Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Matcha-TTS-checkpoints
Repository specific for hosting Matcha-TTS's checkpoints in its release. Mitigation due to the bug in gdown
Neural-HMM
Neural HMMs are all you need (for high-quality attention-free TTS)
OverFlow
Putting flows on top of neural transducers for better TTS
ScanX
This tool used nmap and scanpbnj modules to develop a mini shodan type engine that can search according to any service running on the vairous hosts, It connects the nmap results to the database providing a proper frontend with an administrative panel.
shivammehta25's Repositories
shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
shivammehta25/Neural-HMM
Neural HMMs are all you need (for high-quality attention-free TTS)
shivammehta25/OverFlow
Putting flows on top of neural transducers for better TTS
shivammehta25/Diff-TTSG
Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis
shivammehta25/BetterFastSpeech2
Just another FastSpeech 2 but cleaner code :)
shivammehta25/Match-TTSG
shivammehta25/Matcha-TTS-checkpoints
Repository specific for hosting Matcha-TTS's checkpoints in its release. Mitigation due to the bug in gdown
shivammehta25/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
shivammehta25/lightning-tutorials
Collection of Pytorch lightning tutorial form as rich scripts automatically transformed to ipython notebooks.
shivammehta25/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
shivammehta25/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
shivammehta25/analysis-utilities
shivammehta25/Bayesian-Flow-Networks
A simple implimentation of Bayesian Flow Networks (BFN)
shivammehta25/CLAP
Contrastive Language-Audio Pretraining
shivammehta25/conditional-flow-matching
Conditional Flow Matching: Simulation-Free Dynamic Optimal Transport
shivammehta25/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
shivammehta25/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
shivammehta25/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
shivammehta25/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
shivammehta25/Fun-Coding
I will be saving and committing everyday, Something or update Study progress or Notes.
shivammehta25/Grad-TTS_Repo
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
shivammehta25/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
shivammehta25/NeMo
NeMo: a toolkit for conversational AI
shivammehta25/Nvidia-DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
shivammehta25/open-tts-tracker
shivammehta25/rustlings
:crab: Small exercises to get you used to reading and writing Rust code!
shivammehta25/shivammehta25
shivammehta25/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
shivammehta25/wasp_SE_course
Resources and student assignments for the WASP Software Engineering course
shivammehta25/WhisperFusion
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.