raikarsagar

ML Architect | Conversational AI | NLP | Voice ai

Jivi aiBangalore, India

Pinned Repositories

academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript00
agri_crop_prediction
Language:Python00
asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
Language:Python00
Audio-and-text-based-emotion-recognition
A multimodal approach on emotion recognition using audio and text.
Language:Jupyter Notebook00
awesome-kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
00
cuddly-garbanzo
Language:Dockerfile00
deepspeech-ASR
Mozilla deepspeech Automatic speech recognition System
00
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python00
fastspeech2_custom
00
image-background-changer
Change the background of an image using semantic segmentation
Language:Python10

raikarsagar's Repositories

raikarsagar/image-background-changer
Change the background of an image using semantic segmentation
Language:Python10
raikarsagar/vocode-react-sdk
Language:TypeScript1
raikarsagar/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript00
raikarsagar/agri_crop_prediction
Language:Python00
raikarsagar/asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
Language:Python00
raikarsagar/Audio-and-text-based-emotion-recognition
A multimodal approach on emotion recognition using audio and text.
Language:Jupyter Notebook00
raikarsagar/awesome-kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
00
raikarsagar/cuddly-garbanzo
Language:Dockerfile00
raikarsagar/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python00
raikarsagar/fastspeech2_custom
00
raikarsagar/flowtron
Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer
Language:Jupyter Notebook00
raikarsagar/From-0-to-Research-Scientist-resources-guide
Detailed and tailored guide for undergraduate students or anybody want to dig deep into the field of AI with solid foundation.
raikarsagar/FullSubNet
PyTorch implementation of "A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
raikarsagar/GradTTS
Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"
raikarsagar/ml-with-audio
HF's ML for Audio study group
raikarsagar/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
raikarsagar/multimodal-speech-emotion-recognition
Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)
raikarsagar/NeMo
NeMo: a toolkit for conversational AI
raikarsagar/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
raikarsagar/pyloudnorm
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
raikarsagar/raikarsagar.github.io
Website
Language:SCSS
raikarsagar/speechbrain
A PyTorch-based Speech Toolkit
raikarsagar/SpeechDenoisingWithDeepFeatureLosses
Speech Denoising with Deep Feature Losses
raikarsagar/spyder
Simple Python package for fast DER computation
raikarsagar/svrinfo_proj
raikarsagar/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Language:Jupyter Notebook
raikarsagar/tacotron2_inference
NVIDIA tacotron2 repo with custom inference scripts
Language:Jupyter Notebook
raikarsagar/tacotron2_waveglow
raikarsagar/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E, WIP
Language:Python
raikarsagar/waveglow
A Flow-based Generative Network for Speech Synthesis
Language:Python