Mashiro009's Stars
nttcslab-sp/kaldiio
A pure python module for reading and writing kaldi ark files
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Andong-Li-speech/Neural-Vocoders-as-Speech-Enhancers
teticio/audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.
audiolabs/torch-pesq
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
naba89/istft-torch
Quick and naive translation of torch.istft() to python with option to skip NOLA check.
YoonhyungLee94/SSFCVAE
Official PyTorch implementation of the paper "Boosting Speech Enhancement with Clean Self-Supervised Features via Conditional Variational Autoencoders"
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
vdumoulin/conv_arithmetic
A technical report on convolution arithmetic in the context of deep learning
starrytong/SCNet
Emrys365/DNS_text
Transcripts of the DNS Challenge test sets
merlresearch/tf-locoformer
Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
fgnt/pb_bss
Collection of EM algorithms for blind source separation of audio signals
ashutosh620/DDAEC
aliutkus/speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
WenzheLiu-Speech/awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
csukuangfj/kaldi_native_io
python wrapper for kaldi's native I/O
yxlu-0102/MP-SENet
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
akaashdash/xlstm
MrYxJ/calculate-flops.pytorch
The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)
sovrasov/flops-counter.pytorch
Flops counter for convolutional networks in pytorch framework
RoyChao19477/PCS
Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)
RoyChao19477/SEMamba
This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)
ioyy900205/MFNet
This repo provides the processed samples of the manuscript "a Mask Free Neural Network for Monaural Speech Enhancement", which was accepted by INTERSPEECH2023.
key2miao/TSTNN
transformer based neural network for speech enhancement in time domain
dmlguq456/SepReformer
Official repository of SepReformer for speech separation
tomasJwYU/AutoPrepDemo
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data
nickcercone/spectrogram
haoxiangsnr/SpEx
Implementation of "SpEx: Multi-Scale Time Domain Speaker Extraction Network".
mborsdorf/UniversalSpeakerExtraction