gwkrsrch
Applied Research Scientist at NAVER Cloud AI | Ph.D. student at KAIST AI | Previously at Kyoto University | Homepage: https://geewook.kim/
NAVER Cloud AI & KAIST AI
Pinned Repositories
deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
webvicob
Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023
cream
Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023
elva
On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, EMNLP 2024
prometheus-vision
[ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on customized score rubric, Prometheus-Vision is a good alternative for human evaluation and GPT-4V evaluation.
gwkrsrch's Repositories
gwkrsrch doesn’t have any repository yet.