KeisukeImoto's Stars
KeisukeImoto/RWCPSSD_Onomatopoeia
RWCP-SSD-Onomatopoeia
DCASE2023-Task7-Foley-Sound-Synthesis/dcase2023_task7_baseline
sarulab-speech/visual-onoma-to-wave
Visual onoma-to-wave official implementation
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
KeisukeImoto/mtl_sed_asc
Joint analysis of sound events and acoustic scenes based on multitask learning
qiuqiangkong/audioset_tagging_cnn
CrowdCurio/audio-annotator
A JavaScript interface for annotating and labeling audio files.
an-tran528/wavetransformer
Code base for WaveTransformer: A novel architecture for automated audio captioning
audio-captioning/audio-captioning-papers
A list of papers about audio captioning
audio-captioning/dcase-2020-baseline
Audio captioning baseline system for DCASE 2020 challenge.
toni-heittola/js-datatable
JQuery plugin to generate dynamic HTML tables with data visualization https://toni-heittola.github.io/js-datatable/
karolpiczak/ESC-50
ESC-50: Dataset for Environmental Sound Classification
DCASE-REPO/dcase_util
A collection of utilities for Detection and Classification of Acoustic Scenes and Events
espnet/espnet
End-to-End Speech Processing Toolkit