Pinned Repositories
awesome-audiovisual-learning
A curated list of audio-visual learning methods and datasets.
CSOL_TPAMI2021
The repo for "Class-aware Sounding Objects Localization", TPAMI 2021.
LLM_articulated_object_manipulation
MMCosine_ICASSP23
The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"
MMPareto_ICML2024
The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024
MUSIC-AVQA
MUSIC-AVQA, CVPR2022 (ORAL)
MWAFM
Multi-Scale Attention for Audio Question Answering
OGM-GE_CVPR2022
The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)
Ref-AVS
The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024
Valuate-and-Enhance-Multimodal-Cooperation
The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024
GeWu-Lab's Repositories
GeWu-Lab/awesome-audiovisual-learning
A curated list of audio-visual learning methods and datasets.
GeWu-Lab/OGM-GE_CVPR2022
The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)
GeWu-Lab/MUSIC-AVQA
MUSIC-AVQA, CVPR2022 (ORAL)
GeWu-Lab/Valuate-and-Enhance-Multimodal-Cooperation
The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024
GeWu-Lab/MWAFM
Multi-Scale Attention for Audio Question Answering
GeWu-Lab/LLM_articulated_object_manipulation
GeWu-Lab/MMPareto_ICML2024
The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024
GeWu-Lab/MMCosine_ICASSP23
The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"
GeWu-Lab/PSTP-Net
GeWu-Lab/Ref-AVS
The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024
GeWu-Lab/Certifiable-Robust-Multi-modal-Training
A python implement for Certifiable Robust Multi-modal Training
GeWu-Lab/Generalizable-Audio-Visual-Segmentation
Official repository of "Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer", AAAI 2024
GeWu-Lab/TSPM
Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.
GeWu-Lab/Diagnosing_Relearning_ECCV2024
The repo for "Diagnosing and Re-learning for Balanced Multi-modal Learning", ECCV 2024
GeWu-Lab/LFAV
Towards Long Form Audio-visual Video Understanding
GeWu-Lab/Stepping-Stones
The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024
GeWu-Lab/awesome-balanced-multimodal-learning
A curated list of balanced multimodal learning methods.
GeWu-Lab/cross-modal-distillation
GeWu-Lab/Revisiting-Pre-training-in-Audio-Visual-Learning
The repo for "Revisiting Pre-training in Audio-Visual Learning"
GeWu-Lab/Geometric-Inspired-Graph-based-Incomplete-Multi-view-Clustering
A python implement for Geometric-Inspired Graph-based Incomplete Multi-view Clustering
GeWu-Lab/bias_in_AVS
Official repository for "Unveiling and Mitigating Bias in Audio Visual Segmentation" in ACM MM 2024
GeWu-Lab/Sounding-Object-Segmentation-Preference
The official repo for "Can Textual Semantics Mitigate Sounding Object Segmentation Preference?", ECCV 2024
GeWu-Lab/DepthHelps-IROS2024
GeWu-Lab/.github
GeWu-Lab/audio-visual-learning
https://gewu-lab.github.io/audio-visual-learning/
GeWu-Lab/gewu-lab.github.io
GeWu-Lab/MMCosine
Project page for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"
GeWu-Lab/Balanced-Audiovisual-Dataset
GeWu-Lab/llm_for_articulated_object_manipulation
GeWu-Lab/stepping_stones