Pinned Repositories
adapres-CLIP
Chatrio-backend
Chatrio_frontend
DK5BreakingCrew
Tsinghua DK5
Laion_clean
LLaVA-UHD
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
VisCPM
MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
VisCPM
[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
GUICourse
GUICourse: From General Vision Langauge Models to Versatile GUI Agents
Cuiunbo's Repositories
Cuiunbo/Chatrio_frontend
Cuiunbo/LLaVA-UHD
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Cuiunbo/adapres-CLIP
Cuiunbo/Chatrio-backend
Cuiunbo/DK5BreakingCrew
Tsinghua DK5
Cuiunbo/Laion_clean
Cuiunbo/VisCPM
Cuiunbo/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 30+ HF models, 15+ benchmarks