Pinned Repositories
SQUAT
The official code for Devil's on the Edges: Selective Quad Attention for Scene Graph Generation, CVPR2023.
OneChart
[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
Fox
official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"
foxhome
GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
hello-world
This is my first try in GitHub.
InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
Vary
Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
X-LineNetv2
ucaslcl's Repositories
ucaslcl/Fox
official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"
ucaslcl/foxhome
ucaslcl/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
ucaslcl/hello-world
This is my first try in GitHub.
ucaslcl/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
ucaslcl/Vary
Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
ucaslcl/X-LineNetv2