Pinned Repositories
Decision-level-data-fusion-of-Image-and-Speech-recognition-system
“Detecting an object in an area based on the command given through speech using MFCC and HMM for Speech \& HOG and SVM for Image as Features and classifier respectively”
espnet
End-to-End Speech Processing Toolkit
IPL_Analysis
MWSG_IEEE_Paper
Codes for paper "Spectrogram enhancement using multiple window Savitzky Golay (MWSG) filter for robust bird sound detection" which is published in IEEE Transactions on Speech,Audio and Language Processing March 2017
mydotfiles
pyannote-metrics
A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems
siamese-NLI
Experimenting Siamese Networks for Natural Language Inference
Tensorflow_Keras_deepLearningCodes
VisualQuestion_VQA
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
nithinraok's Repositories
nithinraok/VisualQuestion_VQA
nithinraok/espnet
End-to-End Speech Processing Toolkit
nithinraok/IPL_Analysis
nithinraok/mydotfiles
nithinraok/NeMo
NeMo: a toolkit for conversational AI
nithinraok/pyannote-metrics
A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems
nithinraok/siamese-NLI
Experimenting Siamese Networks for Natural Language Inference
nithinraok/Tensorflow_Keras_deepLearningCodes
nithinraok/DL_Course
nithinraok/AnatomyOfMatplotlib
Anatomy of Matplotlib -- tutorial developed for the SciPy conference
nithinraok/cmusphinx.github.io
CMUSphinx Website
nithinraok/DataCarpentaryWorkshop
nithinraok/fashion-mnist
A MNIST-like fashion product database. Benchmark :point_right:
nithinraok/fast_align
Simple, fast unsupervised word aligner
nithinraok/find_phone_task_4
nithinraok/gecko
Gecko - A Tool for Effective Annotation of Human Conversations
nithinraok/kaldi
This is now the official location of the Kaldi project.
nithinraok/murel.bootstrap.pytorch
MUREL (CVPR 2019), a multimodal relational reasoning module for VQA
nithinraok/nithinraok.github.io
my Website
nithinraok/open_asr_leaderboard
nithinraok/Processing
Games,Designs and Signal Processing stuff created using Processing
nithinraok/PythonPrograms
Programs I created using Python
nithinraok/pytorch-lightning
The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
nithinraok/Pytorch_DeepLearning
nithinraok/resume
This is my resume.
nithinraok/siamese-triplet
Siamese and triplet networks with online pair/triplet mining in PyTorch
nithinraok/speech
A PyTorch Implementation of End-to-End Models for Speech-to-Text
nithinraok/Speech_word_DNN
Using Deep Neural Networks to recognize spoken words
nithinraok/SpireLabDemo
nithinraok/TensorFlow2_0_Udacity