Pinned Repositories
ErikZ719's Repositories
ErikZ719/awesome-multimodal-in-medical-imaging
A collection of resources on applications of multi-modal learning in medical imaging.
ErikZ719/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
ErikZ719/benchmarking-chinese-text-recognition
This repository contains datasets and baselines for benchmarking Chinese text recognition.
ErikZ719/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
ErikZ719/handwritten_text
ErikZ719/learning_research
本人的科研经验
ErikZ719/PAI
[ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs
ErikZ719/STDM