Pinned Repositories
awesome-nlp
A curated list of speech and natural language processing resources
delimitGesture_xrmb
Code to delimite articulatory gestures on Tract Variables from XRMB dataset
keras-bucketed-sequence
Keras dataset reader (Sequence) using buckets for RNNs
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
P.808
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).
si_vtln_spkAdapt
Unsupervised speaker adaptation for speaker independent acoustic to articulatory speech inversion
speech-inversion-dnn
Acoustic to Articulatory speech inversion with a Feedforward DNN
speech-inversion-matlabNN
Code to estimate Vocal Tract constriction Variables from speech using a shallow Neural Network trained in Matlab
speech_inversion_rt
Real time version of a speech inversion system
speechfxt
Feature extraction pipeline for various commonly used speech features
ganesa90's Repositories
ganesa90/si_vtln_spkAdapt
Unsupervised speaker adaptation for speaker independent acoustic to articulatory speech inversion
ganesa90/speechfxt
Feature extraction pipeline for various commonly used speech features
ganesa90/awesome-nlp
A curated list of speech and natural language processing resources
ganesa90/delimitGesture_xrmb
Code to delimite articulatory gestures on Tract Variables from XRMB dataset
ganesa90/keras-bucketed-sequence
Keras dataset reader (Sequence) using buckets for RNNs
ganesa90/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
ganesa90/P.808
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).
ganesa90/speech-inversion-dnn
Acoustic to Articulatory speech inversion with a Feedforward DNN
ganesa90/speech-inversion-matlabNN
Code to estimate Vocal Tract constriction Variables from speech using a shallow Neural Network trained in Matlab
ganesa90/speech_inversion_rt
Real time version of a speech inversion system
ganesa90/stock2music
This Matlab project is to help blind people visualize stock data
ganesa90/vakyansh-wav2vec2-experimentation
Repository containing experimentation platform on how to train, infer on wav2vec2 models.
ganesa90/vowel_classifier
Classifying vowel segments