Pinned Repositories
asteroid
The PyTorch-based audio source separation toolkit for researchers
Code-for-Griffin-Lim-like-phase-recovery-via-ADMM
madeon-asr
[SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition
plt-docker
pr4sss_python
Phase reconstruction for sound source separation
pytorch-ltfatpy
s3prl
Audio Foundation Models (Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit)
signal-reconstruction-from-mel-spectrogram
Audio demos for "Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase."
speech-command-recognition-with-pytorch-lightning
torchaudioのtutorialをpytorch lightningを使って書き直しました.
speech-enhancement-with-pytorch-lightning
YoshikiMas's Repositories
YoshikiMas/madeon-asr
[SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition
YoshikiMas/asteroid
The PyTorch-based audio source separation toolkit for researchers
YoshikiMas/s3prl
Audio Foundation Models (Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit)
YoshikiMas/signal-reconstruction-from-mel-spectrogram
Audio demos for "Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase."
YoshikiMas/SPMamba
YoshikiMas/asteroid-docker
Docker for Speech Separation and Enhancement by Using Asteroid
YoshikiMas/AmplitudeMatching
A multizone sound field control method to synthesize a desired amplitude (or magnitude) distributions over a target region with multiple loudspeakers
YoshikiMas/asteroid_jaCappella
YoshikiMas/AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
YoshikiMas/BS-RoFormer
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
YoshikiMas/BSRNN
YoshikiMas/clarity
Clarity Challenges
YoshikiMas/dcase2024_task9_baseline
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
YoshikiMas/demo-page-example
An example for audio demo page
YoshikiMas/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
YoshikiMas/espnet
End-to-End Speech Processing Toolkit
YoshikiMas/hartufo
A Python toolkit for data-driven HRTF research
YoshikiMas/hearinganythinganywhere
Hearing Anything Anywhere Code Release
YoshikiMas/HRTF-upsampling-with-a-generative-adversarial-network-using-a-gnomonic-equiangular-projection
YoshikiMas/LAPChallenge
The LAP Challenge aims at advancing spatial audio technologies through the personalization of HRTFs.
YoshikiMas/MeshRIR
MeshRIR: Dataset of room impulse responses on meshed grid points
YoshikiMas/mimo-iris
Demo page for the integration of speech separation and recognition with self-supervised learning representation
YoshikiMas/nlg-eval
Evaluation code for various unsupervised automated metrics for Natural Language Generation.
YoshikiMas/pykaldi2
Yet another speech toolkit based on Kaldi and PyTorch
YoshikiMas/pysepm
Python implementation of performance metrics in Loizou's Speech Enhancement book
YoshikiMas/RIRPINN
Room Impulse Response reconstruction with Physics Informed Neural Networks
YoshikiMas/signal-reconstruction-from-mel-spectrogram-via-admm
Audio demos for "Mel-Spectrogram Inversion via Alternating Direction Method of Multipliers."
YoshikiMas/Spatial-Audio-Metrics
Spatial Audio Metrics (SAM) is a toolbox to analyse spatial audio and spatial audio perceptual experiments
YoshikiMas/spear-tools
SPEAR Challenge scripts and tools.
YoshikiMas/spear-tools-waspaa2023
Multichannel Subband-Fullband Gated Convolutional Recurrent Neural Network For Direction-Based Speech Enhancement With Head-Mounted Microphone Arrays