kaen2891

Applied Scientist Intern at Amazon | PhD student at Kyungpook National University | Ex-Naver Intern

Kyungpook National UniversitySouth Korea

Pinned Repositories

adversarial_fine-tuning_using_generated_respiratory_sound
(NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class Imbalance"
Language:Python16 2 02
bts
(INTERSPEECH 2024) Official Implementation of "BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification"
Language:Python8 3 00
epd_for_vad
find end point with statistical voice activity detection model
Language:Python1 1 00
improved_spoken_language_representation
Official code of Improved Spoken Language Representation for Intent Understanding in a Task-Oriented Dialogue System
Language:Python1 1 00
military_audio_dataset
Official code implementation of "MAD: A Military Audio Dataset for Situational Awareness and Surveillance"
Language:Python2 1 02
Multi_DTW
Multi_DTW_with_Spectrogram
Language:Python2 1 02
s3prl2
modifying s3prl
Language:Python3 1 00
stethoscope-guided_supervised_contrastive_learning
(ICASSP 2024) Official Implementation of "Stethoscope-guided Supervised Contrastive Learning for Cross-domin Adaptation on Respiratory Sound Classification"
Language:Python11 2 30
voice_conversion_ijcnn2020
Research Results
Language:Python3 2 00
patch-mix_contrastive_learning
Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification (INTERSPEECH 2023)
Language:Python60 4 811

kaen2891's Repositories

kaen2891/adversarial_fine-tuning_using_generated_respiratory_sound
(NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class Imbalance"
Language:Python16 2 02
kaen2891/stethoscope-guided_supervised_contrastive_learning
(ICASSP 2024) Official Implementation of "Stethoscope-guided Supervised Contrastive Learning for Cross-domin Adaptation on Respiratory Sound Classification"
Language:Python11 2 30
kaen2891/bts
(INTERSPEECH 2024) Official Implementation of "BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification"
Language:Python8 3 00
kaen2891/s3prl2
modifying s3prl
Language:Python3 1 00
kaen2891/military_audio_dataset
Official code implementation of "MAD: A Military Audio Dataset for Situational Awareness and Surveillance"
Language:Python2 1 02
kaen2891/epd_for_vad
find end point with statistical voice activity detection model
Language:Python1 1 00
kaen2891/improved_spoken_language_representation
Official code of Improved Spoken Language Representation for Intent Understanding in a Task-Oriented Dialogue System
Language:Python1 1 00
kaen2891/spec_augment
🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Language:Jupyter Notebook1 0 00
kaen2891/a1003
Language:Jupyter Notebook0 0 00
kaen2891/al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
Language:HTML0 0 00
kaen2891/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
Language:Jupyter Notebook0 0
kaen2891/AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
Language:Python0 0
kaen2891/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Language:Python0 0
kaen2891/etri_multimodal
Language:Python0 0
kaen2891/FilterAugSED
Language:Python0 0
kaen2891/GenSCL
Official Pytorch implementation of "Generalized Supervised Contrastive Learning Framework"
Language:Python0 0
kaen2891/kenlm
KenLM: Faster and Smaller Language Model Queries
Language:C++0 0
kaen2891/korean_speech_data_preprocessing
preprocessing of AIHub Korean speech dataset
Language:Python1 0
kaen2891/Llama-2
All the projects related to Llama
Language:Jupyter Notebook0 0
kaen2891/lva
LG AI Intermediate Courses (Computer Vision)
Language:Jupyter Notebook0 0
kaen2891/misp2022_baseline
Language:Shell0 0
kaen2891/nn_basic_2021
codes for neural network basic course
Language:Jupyter Notebook1 0
kaen2891/profile
Language:HTML1 0
kaen2891/releasing-research-code
Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)
0 0
kaen2891/s3prl_modified
s3prl modeification
Language:Python1 0
kaen2891/self-supervised-speech-recognition
speech to text with self-supervised learning based on wav2vec 2.0 framework
Language:Python0 0
kaen2891/SMART-G2P
Language:Python0 0
kaen2891/ssast
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
Language:Python0 0
kaen2891/SupContrast
PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)
Language:Python0 0
kaen2891/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
Language:Python0 0