Pinned Repositories
asteroid
The PyTorch-based audio source separation toolkit for researchers
Code-for-Griffin-Lim-like-phase-recovery-via-ADMM
plt-docker
pr4sss_python
Phase reconstruction for sound source separation
pytorch-ltfatpy
s3prl
Audio Foundation Models (Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit)
signal-reconstruction-from-mel-spectrogram
Audio demos for "Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase."
SoundSourceSeparation
The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.
speech-command-recognition-with-pytorch-lightning
torchaudioのtutorialをpytorch lightningを使って書き直しました.
speech-enhancement-with-pytorch-lightning
YoshikiMas's Repositories
YoshikiMas/asteroid
The PyTorch-based audio source separation toolkit for researchers
YoshikiMas/s3prl
Audio Foundation Models (Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit)
YoshikiMas/signal-reconstruction-from-mel-spectrogram
Audio demos for "Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase."
YoshikiMas/SPMamba
YoshikiMas/asteroid-docker
Docker for Speech Separation and Enhancement by Using Asteroid
YoshikiMas/AmplitudeMatching
A multizone sound field control method to synthesize a desired amplitude (or magnitude) distributions over a target region with multiple loudspeakers
YoshikiMas/asteroid_jaCappella
YoshikiMas/AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
YoshikiMas/BS-RoFormer
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
YoshikiMas/BSRNN
YoshikiMas/clarity
Clarity Challenges
YoshikiMas/dcase2024_task9_baseline
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
YoshikiMas/demo-page-example
An example for audio demo page
YoshikiMas/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
YoshikiMas/espnet
End-to-End Speech Processing Toolkit
YoshikiMas/hartufo
A Python toolkit for data-driven HRTF research
YoshikiMas/HRTF-upsampling-with-a-generative-adversarial-network-using-a-gnomonic-equiangular-projection
YoshikiMas/LAPChallenge
The LAP Challenge aims at advancing spatial audio technologies through the personalization of HRTFs.
YoshikiMas/libri_css
Libri-CSS: dataset and evaluation pipeline
YoshikiMas/MeshRIR
MeshRIR: Dataset of room impulse responses on meshed grid points
YoshikiMas/mimo-iris
Demo page for the integration of speech separation and recognition with self-supervised learning representation
YoshikiMas/mvae-ss
YoshikiMas/nlg-eval
Evaluation code for various unsupervised automated metrics for Natural Language Generation.
YoshikiMas/paderwasn
Paderwasn is a collection of methods for acoustic signal processing in wireless acoustic sensor networks (WASNs).
YoshikiMas/pykaldi2
Yet another speech toolkit based on Kaldi and PyTorch
YoshikiMas/pysepm
Python implementation of performance metrics in Loizou's Speech Enhancement book
YoshikiMas/Spatial-Audio-Metrics
Spatial Audio Metrics (SAM) is a toolbox to analyse spatial audio and spatial audio perceptual experiments
YoshikiMas/spear-tools
SPEAR Challenge scripts and tools.
YoshikiMas/spear-tools-waspaa2023
Multichannel Subband-Fullband Gated Convolutional Recurrent Neural Network For Direction-Based Speech Enhancement With Head-Mounted Microphone Arrays
YoshikiMas/whisper-asr-finetune