markostam's Stars
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Maratyszcza/PeachPy
x86-64 assembler embedded in Python
LCAV/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
mmorise/World
A high-quality speech analysis, manipulation and synthesis system
aliutkus/speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
ChihebTrabelsi/deep_complex_networks
Implementation related to the Deep Complex Networks
kaituoxu/Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
breizhn/DTLN
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
sevagh/pitch-detection
autocorrelation-based O(NlogN) pitch detection
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
BenWhetton/keras-surgeon
Pruning and other network surgery for trained Keras models.
etzinis/sudo_rm_rf
Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.
microsoft/P.808
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).
espnet/interspeech2019-tutorial
INTERSPEECH 2019 Tutorial Materials
sashkab/homebrew-python
Homebrew tap for Python versions.
zhr1201/CNN-for-single-channel-speech-enhancement
Convolutional neural nets for single channel speech enhancement
thevasudevgupta/gsoc-wav2vec2
GSoC'2021 | TensorFlow implementation of Wav2Vec2
jatinchowdhury18/audio_dspy
A Python package for audio signal processing tools
opsxcq/meme-vibing-cat
Vibing Cat meme generator
IMLHF/Speech-Enhancement-Measures
speech enhancement metrics:CSIG, CBAK, CMOS, SSNR, PESQ, STOI, ESTOI, SNR, IS, LLR, WSS
jmcasebeer/autodsp
Train custom adaptive filter optimizers without hand tuning or extra labels.
ltfat/phaseret
Phase ReTrieval for time-frequency representations
sevagh/audio-degradation-toolbox
easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox
BrechtDeMan/loudness.py
EBU R128 / ITU-R BS.1770 integrated loudness measurement in Python
lonce/SPSI_Python
Single Pass Spectrogram Inversion in a Jupyter Python notebook
funcwj/chime4-nn-mask
Implementation of NN based mask estimator in pytorch
sanowar-raihan/fourier-feature-superresolution
Fourier Features for Image, Audio, and Video Super-Resolution
Mak-Sim/IRAPT
Instantaneous pitch estimation based on RAPT framework (EUSIPCO-2012)
modulate-ai/wav_logger
Real-Time Audio Logging in C++
Jonathan-LeRoux/NBAcomebacks
Analysis of the largest comebacks in the NBA