Pinned Repositories
arXiv_recbot
A Telegram bot to recommend arXiv papers
ChatGPT
run demo
iFormer
Official Pytorch implementation for "IFORMER: INTEGRATING CONVNET AND TRANSFORMER FOR MOBILE APPLICATION" [ICLR 2025]
L2ViT
Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer
SJTUgaze
A Multiview Dataset for Gaze Estimation
VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
VLMEvalKit
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
RepViT
RepViT: Revisiting Mobile CNN From ViT Perspective [CVPR 2024] and RepViT-SAM: Towards Real-Time Segmenting Anything
ChuanyangZheng's Repositories
ChuanyangZheng/iFormer
Official Pytorch implementation for "IFORMER: INTEGRATING CONVNET AND TRANSFORMER FOR MOBILE APPLICATION" [ICLR 2025]
ChuanyangZheng/L2ViT
Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer
ChuanyangZheng/SJTUgaze
A Multiview Dataset for Gaze Estimation
ChuanyangZheng/arXiv_recbot
A Telegram bot to recommend arXiv papers
ChuanyangZheng/ChatGPT
run demo
ChuanyangZheng/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks