iamunr4v31
AI researcher | AI4Bharat | Working in Speech and Language | Interested in Multi-modal AI research.
AI4BharatChennai
Pinned Repositories
about
A simple portfolio Jekyll theme:
astravani
Weapons to wield sound
CS6910-Assignment2
CS6910-Assignments1
Assignments and Projects for CS6910 by Prof. Mitesh Khapra
DeepRL
Various algorithms implemented with CLI for easier training and testing purposes
idle_time
Usage of module gives the idle time of the computer. Note: Linux requires xprintidle. Use "sudo apt install xprintidle" for the module to work.
NewsCluster
Scrape and cluster news based on the headlines
Roar
Roar - a toolkit for Indic Speech AI
iamunr4v31's Repositories
iamunr4v31/astravani
Weapons to wield sound
iamunr4v31/CS6910-Assignment2
iamunr4v31/CS6910-Assignment3
Part 3 of CS6910 Assignments
iamunr4v31/CS6910-Assignments1
Assignments and Projects for CS6910 by Prof. Mitesh Khapra
iamunr4v31/Roar
Roar - a toolkit for Indic Speech AI
iamunr4v31/advice
A repository of links with advice related to grad school applications, research, phd etc
iamunr4v31/attorch
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
iamunr4v31/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
iamunr4v31/conditional-flow-matching
TorchCFM: a Conditional Flow Matching library
iamunr4v31/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
iamunr4v31/deep-learning-v2-pytorch
Projects and exercises for the latest Deep Learning ND program https://www.udacity.com/course/deep-learning-nanodegree--nd101
iamunr4v31/Diff-HierVC
Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation"
iamunr4v31/dl-fundamentals
Deep Learning Fundamentals -- Code material and exercises
iamunr4v31/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
iamunr4v31/google-research
Google Research
iamunr4v31/HoMM
High order Moment Models
iamunr4v31/iamunr4v31
iamunr4v31/iamunr4v31.github.io
Personal website using ai-folio
iamunr4v31/lightning-hydra-template
iamunr4v31/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
iamunr4v31/NeMo
NeMo: a toolkit for conversational AI
iamunr4v31/NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
iamunr4v31/penn
Pitch Estimating Neural Networks (PENN)
iamunr4v31/pyxis
Container plugin for Slurm Workload Manager
iamunr4v31/StableTTS
Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3
iamunr4v31/Streamlit_template
Streamlit template for hosting audio samples
iamunr4v31/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
iamunr4v31/TTS-data-pipeline
WiP: A TTS data scraping pipeline for youtube with music separation, denoising and auto-transcriptions
iamunr4v31/VoiceFlow-TTS
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
iamunr4v31/x-clip
A concise but complete implementation of CLIP with various experimental improvements from recent papers