xiaokangpeng

xiaokangpeng's Stars

GeWu-Lab/MWAFM
Multi-Scale Attention for Audio Question Answering
Language:Python241
GeWu-Lab/awesome-audiovisual-learning
A curated list of audio-visual learning methods and datasets.
22017
NECOTIS/Sensory-substitution
Demonstration videos for the 2D and 3D mode of the new version of See Differently
1
GeWu-Lab/OGM-GE_CVPR2022
The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)
Language:Python22018
GeWu-Lab/CSOL_TPAMI2021
The repo for "Class-aware Sounding Objects Localization", TPAMI 2021.
Language:Python293
BizhuWu/Two-Stream-Network-PyTorch
Language:Python434
dropreg/R-Drop
Language:Python866107
Aman-4-Real/arXiv_Daily
A toolkit for arXiv papers daily reading. The script will crawl arXiv papers in custom areas everyday and display key information.
Language:Python8
jiangqy/DCMH-CVPR2017
source code for paper "Deep Cross-Modal Hashing"
Language:MATLAB9938
01joy/news-search-engine
新闻搜索引擎
Language:Python427128
The-AI-Summer/self-attention-cv
Implementation of various self-attention mechanisms focused on computer vision. Ongoing repository.
Language:Python1.2k156
jindongwang/transferlearning
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
Language:Python13.3k3.8k
xiaokangpeng/VGGSound
VGGSound: A Large-scale Audio-Visual Dataset
1
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
5.9k842
pytorch/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Language:Python22.2k9.5k
hche11/VGGSound
VGGSound: A Large-scale Audio-Visual Dataset
Language:Python28531
tata1661/FSL-Mate
FSL-Mate: A collection of resources for few-shot learning (FSL).
Language:Python1.7k290
krantiparida/awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
65370
jakesnell/prototypical-networks
Code for the NeurIPS 2017 Paper "Prototypical Networks for Few-shot Learning"
Language:Python1.1k252