xiaokangpeng's Stars
GeWu-Lab/MWAFM
Multi-Scale Attention for Audio Question Answering
GeWu-Lab/awesome-audiovisual-learning
A curated list of audio-visual learning methods and datasets.
NECOTIS/Sensory-substitution
Demonstration videos for the 2D and 3D mode of the new version of See Differently
GeWu-Lab/OGM-GE_CVPR2022
The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)
GeWu-Lab/CSOL_TPAMI2021
The repo for "Class-aware Sounding Objects Localization", TPAMI 2021.
BizhuWu/Two-Stream-Network-PyTorch
dropreg/R-Drop
Aman-4-Real/arXiv_Daily
A toolkit for arXiv papers daily reading. The script will crawl arXiv papers in custom areas everyday and display key information.
jiangqy/DCMH-CVPR2017
source code for paper "Deep Cross-Modal Hashing"
01joy/news-search-engine
新闻搜索引擎
The-AI-Summer/self-attention-cv
Implementation of various self-attention mechanisms focused on computer vision. Ongoing repository.
jindongwang/transferlearning
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
xiaokangpeng/VGGSound
VGGSound: A Large-scale Audio-Visual Dataset
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
pytorch/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
hche11/VGGSound
VGGSound: A Large-scale Audio-Visual Dataset
tata1661/FSL-Mate
FSL-Mate: A collection of resources for few-shot learning (FSL).
krantiparida/awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
jakesnell/prototypical-networks
Code for the NeurIPS 2017 Paper "Prototypical Networks for Few-shot Learning"