Pinned Repositories
Missing-AVQA
Official Repository for "Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality" (ECCV 2024)
MonoWAD
Official Repository for "MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection" (ECCV 2024)
NoPrior_MultiSSL
Official Repository for "Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge" (CVPR 2024)
SIRA-SSL
Official Repository for "Audio-Visual Spatial Integration and Recursive Attention for Robust Sound Source Localization" (ACM MM 2023)
VisualAIKHU's Repositories
VisualAIKHU/MonoWAD
Official Repository for "MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection" (ECCV 2024)
VisualAIKHU/SAMPD
Official Repository for "Multispectral Pedestrian Detection with Sparsely Annotated Label" (AAAI 2025)
VisualAIKHU/SIRA-SSL
Official Repository for "Audio-Visual Spatial Integration and Recursive Attention for Robust Sound Source Localization" (ACM MM 2023)
VisualAIKHU/Keyword-DETR
Official Repository for "Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection" (AAAI 2025)
VisualAIKHU/Missing-AVQA
Official Repository for "Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality" (ECCV 2024)
VisualAIKHU/NoPrior_MultiSSL
Official Repository for "Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge" (CVPR 2024)