Speech, Language, and Deep Learning Lab

The source code of some of the projects conducted in Joseph Keshet's Research Lab.

Pinned Repositories

AutoAligner
Trainable algorithm for accurate force alignment
Language:Rust5 3 00
AutoPhonemeClassifier
Multiclass Phoneme Classifier trained at the frame level
Language:C++6 3 41
AutoVowelDuration
Automatic Measurement of Vowel Duration for Consonant Vowel Consonant (CVC) sound files
Language:C++3 2 00
DeepFormants
Formant Tracking & Estimation
Language:Python73 6 917
DeepPhoneticToolsTutorial
Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15
Language:Python12 3 03
Dr.VOT
Dr.VOT is an a software package for automatic measurement of voice onset time (VOT).
Language:Python26 6 59
FormantsTracker
Language:Python10 2 25
scaler_gan
Language:Python3 1 13
speech_yolo
SpeechYOLO Interspeech 2019
Language:Python41 9 612
ssl_diarization
Self-supervised Speaker Diarization Interspeech 2022 Implementation
Language:Python9 3 10

Speech, Language, and Deep Learning Lab's Repositories

MLSpeech/DeepFormants
Formant Tracking & Estimation
Language:Python73 6 917
MLSpeech/speech_yolo
SpeechYOLO Interspeech 2019
Language:Python41 9 612
MLSpeech/Dr.VOT
Dr.VOT is an a software package for automatic measurement of voice onset time (VOT).
Language:Python26 6 59
MLSpeech/DeepPhoneticToolsTutorial
Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15
Language:Python12 3 03
MLSpeech/FormantsTracker
Language:Python10 2 25
MLSpeech/ssl_diarization
Self-supervised Speaker Diarization Interspeech 2022 Implementation
Language:Python9 3 10
MLSpeech/AutoPhonemeClassifier
Multiclass Phoneme Classifier trained at the frame level
Language:C++6 3 41
MLSpeech/AutoVowelDuration
Automatic Measurement of Vowel Duration for Consonant Vowel Consonant (CVC) sound files
Language:C++3 2 00
MLSpeech/scaler_gan
Language:Python3 1 13
MLSpeech/AutoPreaspiration
A software package for automatic extraction of pre-aspiration from speech segments in audio files, using a trainable algorithm.
Language:C++2 4 0
MLSpeech/GradSeg
Language:Python2 3 11
MLSpeech/AutoVOT
Trainable algorithm for automatic measurement of voice onset time
Language:C++1 3 04
MLSpeech/DDKtor
Language:Python1 2 01
MLSpeech/DeepVOT
Automatic Measurement of Voice Onset Time (VOT) using Deep Recurrent Neural Networks
Language:Python1 2 01
MLSpeech/DeepWDM
Recurrent Neural Networks for Word Duration Measurement
Language:Python1 2 01
MLSpeech/distributed_random_features_svm
a distributed implementation of SVM using random features
Language:C++1 2 0
MLSpeech/DSegKNN
Language:Python1 2 01
MLSpeech/WatermarkNN
Watermarking Deep Neural Networks (USENIX 2018)
Language:Python1 2 0
MLSpeech/.github
0 1 00
MLSpeech/FixedClassificationLayer
Language:Python0 3 00
MLSpeech/MLSpeech.github.io
Machine learning-based tools for fine grained phonetic measurements
Language:HTML0 2 00
MLSpeech/semantic_OOD
0 3 40
MLSpeech/CRP
Training with constant perturbations against adversarial attacks.
Language:Python2 0
MLSpeech/DIFFAR
Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation
Language:Python0 0
MLSpeech/Image-Captioning
This project implements the paper: "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
Language:Python2 0
MLSpeech/percept_sim
Language:Jupyter Notebook
MLSpeech/PiMOD
Pitch Estimation by Multiple Octave Decoders
Language:Python2 01
MLSpeech/WatermarkVerification
MLSpeech/Whisper_denoiser
2 0
MLSpeech/WhisperDenoiser
2 0