Pinned Repositories
MahdiEsrafili's Repositories
MahdiEsrafili/ConvLSTM_pytorch
convolutional lstm implemented in pytorch
MahdiEsrafili/rhasspy-silence
Silence detection in audio stream using webrtcvad
MahdiEsrafili/2D-Keypoints-based-Pose-Classifier
MahdiEsrafili/3D-UNet-PyTorch-Implementation
The implementation of 3D-UNet using PyTorch
MahdiEsrafili/Backend-Internship
مستندات بکاند (جاوا) تاکادمی
MahdiEsrafili/codestar-internship
مستندات کارآموزی کداستار
MahdiEsrafili/commonvoice-th
Kaldi recipe to train commonvoice corpus in Thai language
MahdiEsrafili/ctcdecode
PyTorch CTC Decoder bindings
MahdiEsrafili/fa_kaldi-rhasspy
Persian Kaldi profile for Rhasspy built from open speech data
MahdiEsrafili/flashtext
Extract Keywords from sentence or Replace keywords in sentences.
MahdiEsrafili/github-issue-templates
:symbols: A collection of GitHub issue and pull request templates
MahdiEsrafili/inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
MahdiEsrafili/machine-learning-articles
🧠💬 Articles I wrote about machine learning, archived from MachineCurve.com.
MahdiEsrafili/MahdiEsrafili
Config files for my GitHub profile.
MahdiEsrafili/melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
MahdiEsrafili/MoSQITo
MoSQITo is a unified and modular development framework of key sound quality metrics favoring reproducible science and efficient shared scripting among engineers, teachers and researchers community.
MahdiEsrafili/noisereduce
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
MahdiEsrafili/persian-stt
A Text-To-Speech Model Developed Using 🐸STT
MahdiEsrafili/Persian-Swear-Words
Persian Swear Dataset - you can use in your production to filter unwanted content. دیتاست کلمات نامناسب و بد فارسی برای فیلتر کردن متن ها
MahdiEsrafili/pert
Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging
MahdiEsrafili/pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
MahdiEsrafili/Resemblyzer
A python package to analyze and compare voices with deep learning
MahdiEsrafili/sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
MahdiEsrafili/SNR-Estimation-Using-Deep-Learning
An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning
MahdiEsrafili/Tacotron-2-Persian
Tacotron 2 - Persian
MahdiEsrafili/telegram_channel_link_bot
this bot is a link between private channel and public channel
MahdiEsrafili/VoiceSplit
VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram
MahdiEsrafili/vosk-build-model
How to create your own model for vosk
MahdiEsrafili/wav2vec2-live
An live speech recognition using Facebooks wav2vec 2.0 model.
MahdiEsrafili/Wav2Vec2_PyCTCDecode
Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode