ZongChen

ZongChen's Stars

cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
4.6k488
PKU-ICST-MIPL/FineFMPL_IJCAI2024
Language:Python21
wxz0530/PDEM
1
muzairkhattak/PromptSRC
[ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without Forgetting".
Language:Python2208
hustvl/PersonViT
PersonViT: Large-scale Self-supervised Vision Transformer for Person Re-Identification
Language:Python101
qiuxiaoyu9954/ProMPT
2
nengdong96/CSDN
Language:Python3
AsuradaYuci/TF-CLIP
TF-CLIP: Learning Text-Free CLIP for Video-Based Person Re-identification (AAAI2024)
Language:Python394
cvlab-kaist/Diff-ID
221
RikoLi/PCL-CLIP
Code for "Prototypical Contrastive Learning-based CLIP Fine-tuning for Object Re-identification".
Language:Python151
workingcoder/MCJA
A New Strong and Simple Baseline Method for VI-ReID (Bridging the Gap: Multi-level Cross-modality Joint Alignment for Visible-infrared Person Re-identification)
Language:Python30
WLuLi/CapS-Adapter
[ACMMM 2024] CapS-Adapter: Caption-based MultiModal Adapter in Zero-Shot Classification
Language:Python6
ArchipLab-LinfengZhang/contrastive-deep-supervision
Codes for ECCV2022 paper - contrastive deep supervision
Language:Python684
YBZh/DMN
CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models
Language:Python544
SwinTransformer/Feature-Distillation
Language:Python23611
FereshteShakeri/FewShot-CLIP-Strong-Baseline
Language:Python211
muzairkhattak/multimodal-prompt-learning
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
Language:Python63346
zhengli97/PromptKD
[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"
Language:Python1923
KyanChen/RSMamba
This is the pytorch implement of the paper "RSMamba: Remote Sensing Image Classification with State Space Model"
Language:Python22115
CzAngus/CCLNet
Language:Python143
Vill-Lab/2024-AAAI-HPT
Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)
Language:Python624
KaiyangZhou/CoOp
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
Language:Python1.7k191
FudanVI/benchmarking-chinese-text-recognition
This repository contains datasets and baselines for benchmarking Chinese text recognition.
Language:Python42652
baidu/DuReader
Baseline Systems of DuReader Dataset
Language:Python1.1k308
jaimin001/question-answering-document-images
B.Tech Project Repository
Language:Jupyter Notebook21
Syliz517/CLIP-ReID
Official implementation for "CLIP-ReID: Exploiting Vision-Language Model for Image Re-identification without Concrete Text Labels" (AAAI 2023)
Language:Python25741
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
Language:Jupyter Notebook9.1k1.4k
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Language:Python5.7k466
Imbalance-VLM/Imbalance-VLM
Language:Python1158
yuan687198/TKG-Net
Language:Python3