Mashiro009

Mashiro009's Stars

nttcslab-sp/kaldiio
A pure python module for reading and writing kaldi ark files
Language:Python24935
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python6.1k540
Andong-Li-speech/Neural-Vocoders-as-Speech-Enhancers
Language:Python314
teticio/audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.
Language:Jupyter Notebook70470
audiolabs/torch-pesq
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
Language:Python14214
naba89/istft-torch
Quick and naive translation of torch.istft() to python with option to skip NOLA check.
Language:Python1
YoonhyungLee94/SSFCVAE
Official PyTorch implementation of the paper "Boosting Speech Enhancement with Clean Self-Supervised Features via Conditional Variational Autoencoders"
Language:Python7
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Language:Python2.4k221
vdumoulin/conv_arithmetic
A technical report on convolution arithmetic in the context of deep learning
Language:TeX14k2.3k
starrytong/SCNet
Language:Python432
Emrys365/DNS_text
Transcripts of the DNS Challenge test sets
6
merlresearch/tf-locoformer
Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
Language:Python344
fgnt/pb_bss
Collection of EM algorithms for blind source separation of audio signals
Language:Python26560
ashutosh620/DDAEC
Language:Python4019
aliutkus/speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
Language:Python894153
WenzheLiu-Speech/awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
1k221
csukuangfj/kaldi_native_io
python wrapper for kaldi's native I/O
Language:C++273
yxlu-0102/MP-SENet
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Language:Python29344
akaashdash/xlstm
Language:Python343
MrYxJ/calculate-flops.pytorch
The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)
Language:Python51216
sovrasov/flops-counter.pytorch
Flops counter for convolutional networks in pytorch framework
Language:Python2.8k308
RoyChao19477/PCS
Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)
Language:MATLAB537
RoyChao19477/SEMamba
This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)
Language:Python12312
ioyy900205/MFNet
This repo provides the processed samples of the manuscript "a Mask Free Neural Network for Monaural Speech Enhancement", which was accepted by INTERSPEECH2023.
364
key2miao/TSTNN
transformer based neural network for speech enhancement in time domain
Language:Python6613
dmlguq456/SepReformer
Official repository of SepReformer for speech separation
Language:Python847
tomasJwYU/AutoPrepDemo
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data
Language:SCSS283
nickcercone/spectrogram
Language:Jupyter Notebook5
haoxiangsnr/SpEx
Implementation of "SpEx: Multi-Scale Time Domain Speaker Extraction Network".
Language:Python339
mborsdorf/UniversalSpeakerExtraction
Language:Python144