kaen2891
Applied Scientist Intern at Amazon | PhD student at Kyungpook National University | Ex-Naver Intern
Kyungpook National UniversitySouth Korea
Pinned Repositories
adversarial_fine-tuning_using_generated_respiratory_sound
(NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class Imbalance"
bts
(INTERSPEECH 2024) Official Implementation of "BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification"
epd_for_vad
find end point with statistical voice activity detection model
improved_spoken_language_representation
Official code of Improved Spoken Language Representation for Intent Understanding in a Task-Oriented Dialogue System
military_audio_dataset
Official code implementation of "MAD: A Military Audio Dataset for Situational Awareness and Surveillance"
Multi_DTW
Multi_DTW_with_Spectrogram
s3prl2
modifying s3prl
stethoscope-guided_supervised_contrastive_learning
(ICASSP 2024) Official Implementation of "Stethoscope-guided Supervised Contrastive Learning for Cross-domin Adaptation on Respiratory Sound Classification"
voice_conversion_ijcnn2020
Research Results
patch-mix_contrastive_learning
Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification (INTERSPEECH 2023)
kaen2891's Repositories
kaen2891/adversarial_fine-tuning_using_generated_respiratory_sound
(NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class Imbalance"
kaen2891/stethoscope-guided_supervised_contrastive_learning
(ICASSP 2024) Official Implementation of "Stethoscope-guided Supervised Contrastive Learning for Cross-domin Adaptation on Respiratory Sound Classification"
kaen2891/bts
(INTERSPEECH 2024) Official Implementation of "BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification"
kaen2891/s3prl2
modifying s3prl
kaen2891/military_audio_dataset
Official code implementation of "MAD: A Military Audio Dataset for Situational Awareness and Surveillance"
kaen2891/epd_for_vad
find end point with statistical voice activity detection model
kaen2891/improved_spoken_language_representation
Official code of Improved Spoken Language Representation for Intent Understanding in a Task-Oriented Dialogue System
kaen2891/spec_augment
🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
kaen2891/a1003
kaen2891/al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
kaen2891/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
kaen2891/AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
kaen2891/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
kaen2891/etri_multimodal
kaen2891/FilterAugSED
kaen2891/GenSCL
Official Pytorch implementation of "Generalized Supervised Contrastive Learning Framework"
kaen2891/kenlm
KenLM: Faster and Smaller Language Model Queries
kaen2891/korean_speech_data_preprocessing
preprocessing of AIHub Korean speech dataset
kaen2891/Llama-2
All the projects related to Llama
kaen2891/lva
LG AI Intermediate Courses (Computer Vision)
kaen2891/misp2022_baseline
kaen2891/nn_basic_2021
codes for neural network basic course
kaen2891/profile
kaen2891/releasing-research-code
Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)
kaen2891/s3prl_modified
s3prl modeification
kaen2891/self-supervised-speech-recognition
speech to text with self-supervised learning based on wav2vec 2.0 framework
kaen2891/SMART-G2P
kaen2891/ssast
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
kaen2891/SupContrast
PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)
kaen2891/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E