ZongChen's Stars
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
PKU-ICST-MIPL/FineFMPL_IJCAI2024
wxz0530/PDEM
muzairkhattak/PromptSRC
[ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without Forgetting".
hustvl/PersonViT
PersonViT: Large-scale Self-supervised Vision Transformer for Person Re-Identification
qiuxiaoyu9954/ProMPT
nengdong96/CSDN
AsuradaYuci/TF-CLIP
TF-CLIP: Learning Text-Free CLIP for Video-Based Person Re-identification (AAAI2024)
cvlab-kaist/Diff-ID
RikoLi/PCL-CLIP
Code for "Prototypical Contrastive Learning-based CLIP Fine-tuning for Object Re-identification".
workingcoder/MCJA
A New Strong and Simple Baseline Method for VI-ReID (Bridging the Gap: Multi-level Cross-modality Joint Alignment for Visible-infrared Person Re-identification)
WLuLi/CapS-Adapter
[ACMMM 2024] CapS-Adapter: Caption-based MultiModal Adapter in Zero-Shot Classification
ArchipLab-LinfengZhang/contrastive-deep-supervision
Codes for ECCV2022 paper - contrastive deep supervision
YBZh/DMN
CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models
SwinTransformer/Feature-Distillation
FereshteShakeri/FewShot-CLIP-Strong-Baseline
muzairkhattak/multimodal-prompt-learning
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
zhengli97/PromptKD
[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"
KyanChen/RSMamba
This is the pytorch implement of the paper "RSMamba: Remote Sensing Image Classification with State Space Model"
CzAngus/CCLNet
Vill-Lab/2024-AAAI-HPT
Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)
KaiyangZhou/CoOp
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
FudanVI/benchmarking-chinese-text-recognition
This repository contains datasets and baselines for benchmarking Chinese text recognition.
baidu/DuReader
Baseline Systems of DuReader Dataset
jaimin001/question-answering-document-images
B.Tech Project Repository
Syliz517/CLIP-ReID
Official implementation for "CLIP-ReID: Exploiting Vision-Language Model for Image Re-identification without Concrete Text Labels" (AAAI 2023)
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Imbalance-VLM/Imbalance-VLM
yuan687198/TKG-Net