spoken-language-processing
There are 19 repositories under spoken-language-processing topic.
kahne/SpeechTransProgress
Tracking the progress in end-to-end speech translation
kahne/fastwer
A PyPI package for fast word/character error rate (WER/CER) calculation
ReneeYe/ConST
code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)
praaline/Praaline
Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora
Sherry-XLL/Digital-Recognition-DTW_HMM_GMM
10 digits recognition system based on DTW, HMM and GMM
ReneeYe/XSTNet
This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)
navierula/mood-class
software that analyzes speech utterances
SushantKafle/speechtext-wimp-labeler
This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for understanding its meaning. The model operates on human-annotated corpus of word importance for its training and evaluation. The corpus can be downloaded from: http://latlab.ist.rit.edu/lrec2018
tianleimin/Thesis-EmotionRecognition
Example codes for my PhD work on recognizing dimensional emotions in spoken dialogue
Rawan-Kh/NLP_Datacamp
All NLP related courses on DataCamp
brijmohan/lid-convex-comb
Convex combination of phonotactics for large-scale spoken language identification
gchrupala/peppa
Code for the paper "Learning English with Peppa Pig" https://doi.org/10.48550/arXiv.2202.12917
loonghch/native-language-cnn
Speech subtask of the 2017 NLI Shared Task
el841/ruby
The Ruby Programming Language
malifalhakim/prompt-based-tts-indo
Prompt-based Text-to-Speech system using Parler TTS, designed for generating natural-sounding speech in Indonesian. Includes dataset preparation, model training, inference pipeline, and performance evaluation.
vocaliodmiku/SLI-LL
Repository of the paper: "Spoken Language Intelligence of Large Language Models for Language Learning"
wanghao15536870732/ChatWithEveryone
🚧The Internet + project YiLuYuBan.The project is too messy, has moved to https://github.com/wanghao15536870732/ChatWithChinese
samKenpachi011/Spoken-Language-Processing
A guide to spoken language processing
vunb/is13
RNN for Spoken Language Understanding