mingfei-gao

Mingfei Gao

mingfei-gao's Stars

openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook27k 326 4063.4k
jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Language:Python11k 45 4221.6k
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
Language:Python1.6k 13 141200
yuewang-cuhk/awesome-vision-language-pretraining-papers
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
1.1k 52 9105
hila-chefer/Transformer-MM-Explainability
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.
Language:Jupyter Notebook816 8 36107
ChenRocks/UNITER
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
Language:Python787 18 95109
jokieleung/awesome-visual-question-answering
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.
658 26 395
robustness-gym/robustness-gym
Robustness Gym is an evaluation toolkit for machine learning.
Language:Python442 17 1638
xinke-wang/Awesome-Text-VQA
190 10 311
xumingze0308/TRN.pytorch
[ICCV 2019] Official implementation of Temporal Recurrent Networks for Online Action Detection
Language:Python85 4 2116
salesforce/PB-OVD
A pytorch Implementation of Open Vocabulary Object Detection with Pseudo Bounding-Box Labels
Language:Python58 5 66
LoyoYang/DeCoTa
ICCV 2021: Deep Co-Training with Task Decomposition for Semi-supervised Domain Adaptation
Language:Python16 2 26
salesforce/QVR-SimpleDLM
Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.
Language:Python16 4 13
salesforce/burn-after-reading
Language:Python13 4 04
salesforce/woad-pytorch
This is the pytorch implementation of WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos (CVPR2021).
Language:Python12 6 26
salesforce/inv-cdip
INV-CDIP Dataset
Language:Python9 4 13
salesforce/fieldExtractor
Language:Python5 4 02

mingfei-gao

mingfei-gao's Stars

openai/CLIP

jacobgil/pytorch-grad-cam

salesforce/ALBEF

yuewang-cuhk/awesome-vision-language-pretraining-papers

hila-chefer/Transformer-MM-Explainability

ChenRocks/UNITER

jokieleung/awesome-visual-question-answering

robustness-gym/robustness-gym

xinke-wang/Awesome-Text-VQA

xumingze0308/TRN.pytorch

salesforce/PB-OVD

LoyoYang/DeCoTa

salesforce/QVR-SimpleDLM

salesforce/burn-after-reading

salesforce/woad-pytorch

salesforce/inv-cdip

salesforce/fieldExtractor