GeWu-Lab

Pinned Repositories

awesome-audiovisual-learning
A curated list of audio-visual learning methods and datasets.
218 9 216
CSOL_TPAMI2021
The repo for "Class-aware Sounding Objects Localization", TPAMI 2021.
Language:Python29 1 13
LLM_articulated_object_manipulation
Language:Python22 2 50
MMCosine_ICASSP23
The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"
Language:Python17 3 41
MMPareto_ICML2024
The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024
Language:Python21 3 10
MUSIC-AVQA
MUSIC-AVQA, CVPR2022 (ORAL)
Language:Python65 2 77
MWAFM
Multi-Scale Attention for Audio Question Answering
Language:Python24 2 41
OGM-GE_CVPR2022
The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)
Language:Python218 4 4618
Ref-AVS
The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024
Language:Python140
Valuate-and-Enhance-Multimodal-Cooperation
The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024
Language:Python31 2 01

GeWu-Lab's Repositories

GeWu-Lab/awesome-audiovisual-learning
A curated list of audio-visual learning methods and datasets.
218 9 216
GeWu-Lab/OGM-GE_CVPR2022
The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)
Language:Python218 4 4618
GeWu-Lab/MUSIC-AVQA
MUSIC-AVQA, CVPR2022 (ORAL)
Language:Python65 2 77
GeWu-Lab/Valuate-and-Enhance-Multimodal-Cooperation
The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024
Language:Python31 2 01
GeWu-Lab/MWAFM
Multi-Scale Attention for Audio Question Answering
Language:Python24 2 41
GeWu-Lab/LLM_articulated_object_manipulation
Language:Python22 2 50
GeWu-Lab/MMPareto_ICML2024
The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024
Language:Python21 3 10
GeWu-Lab/MMCosine_ICASSP23
The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"
Language:Python17 3 41
GeWu-Lab/PSTP-Net
Language:Python16 1 31
GeWu-Lab/Ref-AVS
The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024
Language:Python140
GeWu-Lab/Certifiable-Robust-Multi-modal-Training
A python implement for Certifiable Robust Multi-modal Training
Language:Python13 1 20
GeWu-Lab/Generalizable-Audio-Visual-Segmentation
Official repository of "Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer", AAAI 2024
Language:Python13 1 43
GeWu-Lab/TSPM
Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.
Language:Python9
GeWu-Lab/Diagnosing_Relearning_ECCV2024
The repo for "Diagnosing and Re-learning for Balanced Multi-modal Learning", ECCV 2024
Language:Python8 2 20
GeWu-Lab/LFAV
Towards Long Form Audio-visual Video Understanding
Language:Python7 2 20
GeWu-Lab/Stepping-Stones
The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024
Language:Python71
GeWu-Lab/awesome-balanced-multimodal-learning
A curated list of balanced multimodal learning methods.
61
GeWu-Lab/cross-modal-distillation
Language:Python6 1 00
GeWu-Lab/Revisiting-Pre-training-in-Audio-Visual-Learning
The repo for "Revisiting Pre-training in Audio-Visual Learning"
Language:Python60
GeWu-Lab/Geometric-Inspired-Graph-based-Incomplete-Multi-view-Clustering
A python implement for Geometric-Inspired Graph-based Incomplete Multi-view Clustering
Language:Python41
GeWu-Lab/bias_in_AVS
Official repository for "Unveiling and Mitigating Bias in Audio Visual Segmentation" in ACM MM 2024
Language:Python3
GeWu-Lab/Sounding-Object-Segmentation-Preference
The official repo for "Can Textual Semantics Mitigate Sounding Object Segmentation Preference?", ECCV 2024
Language:Python3
GeWu-Lab/DepthHelps-IROS2024
Language:Python2
GeWu-Lab/.github
00
GeWu-Lab/audio-visual-learning
https://gewu-lab.github.io/audio-visual-learning/
Language:JavaScript0 1 00
GeWu-Lab/gewu-lab.github.io
Language:HTML0 1 12
GeWu-Lab/MMCosine
Project page for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"
Language:JavaScript0 2 00
GeWu-Lab/Balanced-Audiovisual-Dataset
Language:JavaScript
GeWu-Lab/llm_for_articulated_object_manipulation
Language:JavaScript
GeWu-Lab/stepping_stones
Language:JavaScript