shekofteh
Yasser Shekofteh is an Assistant Professor at the Faculty of Computer Science and Engineering of Shahid Beheshti University (SBU).
Tehran, Iran
Pinned Repositories
allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
asr_assignment
Code for the first assignment of the ASR course for 2020
Bachelors-Project-Allosaurus
extra files used for bachelor's project
DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
denoising-wavenet-small
E2PCast
E2PCast: An English to Persian Voice Casting Dataset
IIRI-Net
The code of the paper: "IIRI-Net: An interpretable convolutional front-end inspired by IIR filters for speaker identification".
InterpretableCNN
An extended version of SincNet in which some general auditory filter models are added for the Speaker Identification task
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
MineSweeper-Matlab
Matlab Project 99
shekofteh's Repositories
shekofteh/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
shekofteh/asr_assignment
Code for the first assignment of the ASR course for 2020
shekofteh/Bachelors-Project-Allosaurus
extra files used for bachelor's project
shekofteh/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
shekofteh/denoising-wavenet-small
shekofteh/E2PCast
E2PCast: An English to Persian Voice Casting Dataset
shekofteh/IIRI-Net
The code of the paper: "IIRI-Net: An interpretable convolutional front-end inspired by IIR filters for speaker identification".
shekofteh/InterpretableCNN
An extended version of SincNet in which some general auditory filter models are added for the Speaker Identification task
shekofteh/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
shekofteh/MineSweeper-Matlab
Matlab Project 99
shekofteh/PAVID-CVs
Persian Audio-Visual Database
shekofteh/SampleDataWakeWordDetection
shekofteh/SGR_AFM
The code of the paper: "Exploiting auditory filter models as interpretable convolutional frontends to obtain optimal architectures for speaker gender recognition".
shekofteh/ShEMO-Modification
A modification on the ShEMO database
shekofteh/speech-data-gatherer-mobile
shekofteh/Spoken-Language-Identification
shekofteh/Voice-Pathology-Detection-by-Vowel-a-
In this repository, we use two different features and a CNN model to classify healthy and unhealthy samples by vowel /a/ sound
shekofteh/Audio-Classification
Code for YouTube series: Deep Learning for Audio Classification
shekofteh/AudioSignalProcessingForML
Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"
shekofteh/Classification-of-Heart-Sound-Signal-Using-Multiple-Features-
Data plus code fo Classification of Heart Sound Signal Using Multiple Features
shekofteh/E2PCast-Final
A Dataset for English to Persian Voice Casting
shekofteh/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
shekofteh/itsp
Introduction to Speech Processing
shekofteh/math-tools-nyu
DS-GA 1013 Mathematical Tools for Data Science
shekofteh/MOSNet
Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
shekofteh/nn-zero-to-hero
Neural Networks: Zero to Hero
shekofteh/opensmile
The Munich Open-Source Large-Scale Multimedia Feature Extractor
shekofteh/parstwiner
Name Entity Recognition (NER) on the Persian Twitter dataset.
shekofteh/SpeechTransProgress
Tracking the progress in end-to-end speech translation
shekofteh/x-vector-pytorch
Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch