Pinned Repositories
IRRA
Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)
GFNet
[NeurIPS 2021] Global Filter Networks for Image Classification
lightning
The most intuitive, flexible, way for researchers, ML engineers and data scientists to build models (with PyTorch), research workflows and production pipelines with an obsessive focus on flexibility and performance.
lxmert
PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".
Multimodal-Transformer
[ACL'19] [PyTorch] Multimodal Transformer
Towards-Open-Federated-Learning-Platforms-Survey
Material for Model-Centric FML survey
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
HVFormer
Multimodal Relation Extraction via a Mixture of Hierarchical Visual Context Learners. WWW'24
diaoyudiaochan's Repositories
diaoyudiaochan/GFNet
[NeurIPS 2021] Global Filter Networks for Image Classification
diaoyudiaochan/lightning
The most intuitive, flexible, way for researchers, ML engineers and data scientists to build models (with PyTorch), research workflows and production pipelines with an obsessive focus on flexibility and performance.
diaoyudiaochan/lxmert
PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".
diaoyudiaochan/Multimodal-Transformer
[ACL'19] [PyTorch] Multimodal Transformer
diaoyudiaochan/Towards-Open-Federated-Learning-Platforms-Survey
Material for Model-Centric FML survey
diaoyudiaochan/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
diaoyudiaochan/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch