Pinned Repositories
Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
MultimodalOCR
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
VimTS
VimTS: A Unified Video and Image Text Spotter
Eddy1993-GPU's Repositories
Eddy1993-GPU/Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Eddy1993-GPU/MultimodalOCR
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
Eddy1993-GPU/VimTS
VimTS: A Unified Video and Image Text Spotter