YoshikiMas

Pinned Repositories

asteroid
The PyTorch-based audio source separation toolkit for researchers
Language:Python1 0 00
Code-for-Griffin-Lim-like-phase-recovery-via-ADMM
Language:Jupyter Notebook6 0 04
madeon-asr
[SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition
Language:Python9 2 00
plt-docker
Language:Dockerfile1 1 02
pr4sss_python
Phase reconstruction for sound source separation
Language:Jupyter Notebook1 2 70
pytorch-ltfatpy
Language:Dockerfile1 1 00
s3prl
Audio Foundation Models (Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit)
Language:Python1 0 00
signal-reconstruction-from-mel-spectrogram
Audio demos for "Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase."
Language:HTML1 1 00
speech-command-recognition-with-pytorch-lightning
torchaudioのtutorialをpytorch lightningを使って書き直しました．
Language:Jupyter Notebook2 1 00
speech-enhancement-with-pytorch-lightning
Language:Python5 2 12

YoshikiMas's Repositories

YoshikiMas/madeon-asr
[SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition
Language:Python9 2 00
YoshikiMas/asteroid
The PyTorch-based audio source separation toolkit for researchers
Language:Python1 0 00
YoshikiMas/s3prl
Audio Foundation Models (Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit)
Language:Python1 0 00
YoshikiMas/signal-reconstruction-from-mel-spectrogram
Audio demos for "Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase."
Language:HTML1 1 00
YoshikiMas/SPMamba
Language:Python1 0 0
YoshikiMas/asteroid-docker
Docker for Speech Separation and Enhancement by Using Asteroid
Language:Dockerfile0 1 00
YoshikiMas/AmplitudeMatching
A multizone sound field control method to synthesize a desired amplitude (or magnitude) distributions over a target region with multiple loudspeakers
Language:Jupyter Notebook0 0
YoshikiMas/asteroid_jaCappella
Language:Python0 0
YoshikiMas/AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
Language:Python0 0
YoshikiMas/BS-RoFormer
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
Language:Python0 0
YoshikiMas/BSRNN
Language:Python0 0
YoshikiMas/clarity
Clarity Challenges
Language:Python0 0
YoshikiMas/dcase2024_task9_baseline
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
Language:Python0 0
YoshikiMas/demo-page-example
An example for audio demo page
Language:HTML1 0
YoshikiMas/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Language:Python0 0
YoshikiMas/espnet
End-to-End Speech Processing Toolkit
Language:Python0 0
YoshikiMas/hartufo
A Python toolkit for data-driven HRTF research
Language:Python0 0
YoshikiMas/hearinganythinganywhere
Hearing Anything Anywhere Code Release
Language:Jupyter Notebook0 0
YoshikiMas/HRTF-upsampling-with-a-generative-adversarial-network-using-a-gnomonic-equiangular-projection
Language:Python0 0
YoshikiMas/LAPChallenge
The LAP Challenge aims at advancing spatial audio technologies through the personalization of HRTFs.
Language:Jupyter Notebook0 0
YoshikiMas/MeshRIR
MeshRIR: Dataset of room impulse responses on meshed grid points
Language:Jupyter Notebook0 0
YoshikiMas/mimo-iris
Demo page for the integration of speech separation and recognition with self-supervised learning representation
Language:HTML2 0
YoshikiMas/nlg-eval
Evaluation code for various unsupervised automated metrics for Natural Language Generation.
Language:Python0 0
YoshikiMas/pykaldi2
Yet another speech toolkit based on Kaldi and PyTorch
Language:Python0 0
YoshikiMas/pysepm
Python implementation of performance metrics in Loizou's Speech Enhancement book
Language:Python0 0
YoshikiMas/RIRPINN
Room Impulse Response reconstruction with Physics Informed Neural Networks
Language:Jupyter Notebook0 0
YoshikiMas/signal-reconstruction-from-mel-spectrogram-via-admm
Audio demos for "Mel-Spectrogram Inversion via Alternating Direction Method of Multipliers."
Language:HTML1 0
YoshikiMas/Spatial-Audio-Metrics
Spatial Audio Metrics (SAM) is a toolbox to analyse spatial audio and spatial audio perceptual experiments
Language:Python0 0
YoshikiMas/spear-tools
SPEAR Challenge scripts and tools.
Language:Python0 0
YoshikiMas/spear-tools-waspaa2023
Multichannel Subband-Fullband Gated Convolutional Recurrent Neural Network For Direction-Based Speech Enhancement With Head-Mounted Microphone Arrays
Language:Python0 0