mirishkarganesh
PhD scholar at IIIT Hyderabad, India. Working on Automatic Speech Recognition and Speech Enhancement
IIIT HyderabadHyderabad
Pinned Repositories
k0
espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
flashlight
A C++ standalone library for machine learning
k2
FSA/FST algorithms, intended to (eventually) be interoperable with PyTorch and similar
kaldi
This is the official location of the Kaldi project.
speechbrain
A PyTorch-based Speech Toolkit
ASR
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
eesen
The official repository of the Eesen project
End-to-end-ASR-Pytorch
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
mirishkarganesh's Repositories
mirishkarganesh/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
mirishkarganesh/IndicTrans2
Translation models for 22 scheduled languages of India
mirishkarganesh/IndicBERT
Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME
mirishkarganesh/ganesh_w2v
mirishkarganesh/k0
mirishkarganesh/mirishkarganesh
Config files for my GitHub profile.
mirishkarganesh/intermdiate_layer_matter_ssl
The official repository for "Intermediate Layers Matter in Momentum Contrastive Self Supervised Learning" paper.
mirishkarganesh/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
mirishkarganesh/ULCA-asr-dataset-corpus
mirishkarganesh/icon_submission
Multilingual
mirishkarganesh/indic-punct
mirishkarganesh/mirishkarganesh.github.io
this is my website
mirishkarganesh/vakyansh-wav2vec2-experimentation
mirishkarganesh/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
mirishkarganesh/speechbrain
A PyTorch-based Speech Toolkit
mirishkarganesh/neural_sp
End-to-end ASR/LM implementation with PyTorch
mirishkarganesh/flashlight
A C++ standalone library for machine learning
mirishkarganesh/espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
mirishkarganesh/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
mirishkarganesh/RIR-Generator
Generating room impulse responses
mirishkarganesh/inference
Reference implementations of inference benchmarks
mirishkarganesh/lhotse
mirishkarganesh/k2
FSA/FST algorithms, intended to (eventually) be interoperable with PyTorch and similar
mirishkarganesh/nccl
Optimized primitives for collective multi-GPU communication
mirishkarganesh/pytorch_complex
A temporal module for PyTorch-ComplexTensor
mirishkarganesh/espnet
End-to-End Speech Processing Toolkit
mirishkarganesh/kaldi
This is the official location of the Kaldi project.
mirishkarganesh/pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
mirishkarganesh/kaldi-onnx
Kaldi model converter to ONNX
mirishkarganesh/End-to-end-ASR-Pytorch
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.