VisualAIKHU

Pinned Repositories

Missing-AVQA
Official Repository for "Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality" (ECCV 2024)
Language:Python11 0 20
MonoWAD
Official Repository for "MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection" (ECCV 2024)
Language:Python37 2 73
NoPrior_MultiSSL
Official Repository for "Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge" (CVPR 2024)
Language:Python11 0 01
SIRA-SSL
Official Repository for "Audio-Visual Spatial Integration and Recursive Attention for Robust Sound Source Localization" (ACM MM 2023)
Language:Python14 0 23

VisualAIKHU's Repositories

VisualAIKHU/MonoWAD
Official Repository for "MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection" (ECCV 2024)
Language:Python37 2 73
VisualAIKHU/SAMPD
Official Repository for "Multispectral Pedestrian Detection with Sparsely Annotated Label" (AAAI 2025)
Language:Python25 0 00
VisualAIKHU/SIRA-SSL
Official Repository for "Audio-Visual Spatial Integration and Recursive Attention for Robust Sound Source Localization" (ACM MM 2023)
Language:Python14 0 23
VisualAIKHU/Keyword-DETR
Official Repository for "Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection" (AAAI 2025)
Language:Python11 1 2
VisualAIKHU/Missing-AVQA
Official Repository for "Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality" (ECCV 2024)
Language:Python11 0 20
VisualAIKHU/NoPrior_MultiSSL
Official Repository for "Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge" (CVPR 2024)
Language:Python11 0 01