Pinned Repositories
CaraJ7.github.io
CoMat
[Neurips 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
MMSearch
The First Multimodal Seach Engine Pipeline and Benchmark for LMMs
VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
mmsearch.github.io
VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
HoP
[ICCV 2023] Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
MathVerse
[ECCV 2024] Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
CaraJ7's Repositories
CaraJ7/MMSearch
The First Multimodal Seach Engine Pipeline and Benchmark for LMMs
CaraJ7/CoMat
[Neurips 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
CaraJ7/CaraJ7.github.io
CaraJ7/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
CaraJ7/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks