jnaranjo-sciling's Stars
google-research/google-research
Google Research
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
espnet/espnet
End-to-End Speech Processing Toolkit
Uberi/speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
jameslyons/python_speech_features
This library provides common speech features for ASR including MFCCs and filterbank energies.
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
sepandhaghighi/pycm
Multi-class confusion matrix library in Python
YannickJadoul/Parselmouth
Praat in Python, the Pythonic way
astorfi/speechpy
:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
f90/Wave-U-Net
Implementation of the Wave-U-Net for audio source separation
amsehili/auditok
An audio/acoustic activity detection and audio segmentation tool
JeremyCCHsu/Python-Wrapper-for-World-Vocoder
A Python wrapper for the high-quality vocoder "World"
nanahou/Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
breizhn/DTLN
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
yongxuUSTC/sednn
deep learning based speech enhancement using keras or pytorch, make it easy to use
haoxiangsnr/Wave-U-Net-for-Speech-Enhancement
Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.
seanwood/gcc-nmf
Real-time GCC-NMF Blind Speech Separation and Enhancement
espnet/espnet_model_zoo
ESPnet Model Zoo
nmilosev/pytorch-arm-builds
Unofficial PyTorch and torchvision builds for ARM devices
faroit/CountNet
Deep Neural Network for Speaker Count Estimation
Kashu7100/pytorch-armv7l
PyTorch 1.7.0 and torchvision 0.8.0 builds for RaspberryPi 4 (32bit OS)
aadeshnpn/OSDN
Keras implementation for the research paper "Towards Open Set Deep Networks" A Bendale, T Boult, CVPR 2016
maltequast/pytorch_arm_whl
Vauxoo/pstats-print2list
cProfile pstats filter result and group and sort
asteroid-team/asteroid_app
tr7200/Youdens_index
PyTorch and Tf-Keras implementations of an epidemiology metric from Kaivanto (2008), suitable for imbalanced data