Pinned Repositories
APL
[2024 AAAI] Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering
AVSBench
[2022 ECCV] Audio-Visual Segmentation
awesome-audiovisual-learning
A curated list of audio-visual learning methods and datasets.
CPSP
[2023 TPAMI] Contrastive Positive Sample Propagation along the Audio-Visual Event Line
FAVDBench
[CVPR 2023] Official implementation of the paper: Fine-grained Audible Video Description
LEAP
[2024 ECCV] Label-anticipated Event Disentanglement for Audio-Visual Video Parsing
OV-AVEL
[2024 Arxiv] Towards Open-Vocabulary Audio-Visual Event Localization
PSP_CVPR_2021
[2021 CVPR] Positive Sample Propagation along the Audio-Visual Event Line
VPLAN
[2024 IJCV] The official implementation of our paper "Improving Audio-Visual Video Parsing with Pseudo Visual Labels"
AVSBench
[ECCV 2022] & [IJCV 2024] Official implementation of the paper: Audio-Visual Segmentation (with Semantics)
jasongief's Repositories
jasongief/PSP_CVPR_2021
[2021 CVPR] Positive Sample Propagation along the Audio-Visual Event Line
jasongief/CPSP
[2023 TPAMI] Contrastive Positive Sample Propagation along the Audio-Visual Event Line
jasongief/VPLAN
[2024 IJCV] The official implementation of our paper "Improving Audio-Visual Video Parsing with Pseudo Visual Labels"
jasongief/AVSBench
[2022 ECCV] Audio-Visual Segmentation
jasongief/APL
[2024 AAAI] Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering
jasongief/awesome-audiovisual-learning
A curated list of audio-visual learning methods and datasets.
jasongief/video_features
Extract video features from raw videos using multiple GPUs. We support RAFT and PWC flow frames as well as I3D, R(2+1)D, VGGish, ResNet features.