Reagan1311's Stars
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
facebookresearch/dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
UxxHans/Rainbow-Cats-Personal-WeChat-MiniProgram
给女朋友做的微信小程序!情侣自己的任务和商城系统!
hila-chefer/Transformer-Explainability
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
hkchengrex/XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
facebookresearch/co3d
Tooling for the Common Objects In 3D dataset.
jacobgil/vit-explain
Explainability for Vision Transformers
hila-chefer/Transformer-MM-Explainability
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.
isl-org/lang-seg
Language-Driven Semantic Segmentation
MarkMoHR/Awesome-Referring-Image-Segmentation
:books: A collection of papers about Referring Image Segmentation.
SeanChenxy/Hand3DResearch
ShirAmir/dino-vit-features
Official implementation for the paper "Deep ViT Features as Dense Visual Descriptors".
tfzhou/ProtoSeg
CVPR2022 (Oral) - Rethinking Semantic Segmentation: A Prototype View
ddshan/hand_object_detector
Project and dataset webpage:
showlab/EgoVLP
[NeurIPS2022] Egocentric Video-Language Pretraining
ir413/mvp
Masked Visual Pre-training for Robotics
xiaojianzhong/awesome-weakly-supervised-semantic-segmentation
A comprehensive list of weakly supervised semantic segmentation (WSSS) works from 2014 to 2022.
vasgaowei/TS-CAM
Codes for TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.
usr922/wseg
[CVPR'22] Weakly Supervised Semantic Segmentation by Pixel-to-Prototype Contrast
yzqin/dexmv-sim
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos, ECCV 2022
oakink/OakInk
[CVPR 2022] OakInk: A Large-scale Knowledge Repository for Understanding Hand-Object Interaction
owenzlz/EgoHOS
Fine-Grained Egocentric Hand-Object Segmentation, ECCV 2022
maeve07/RCA
zhiweichen0012/Weakly-Supervised-Object-Localization-Paper-List
Weakly Supervised Object Localization Paper List
talshaharabany/what-is-where-by-looking
What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs
showlab/Q2A
[ECCV 2022] AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant
mrudorfer/burg-toolkit
Toolkit for benchmarking and understanding robotic grasping
Jazzcharles/CREAM
Weakly Supervised Object Localization via Class RE-Activation Mapping