Pinned Repositories
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
OneChart
[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
Aircraft-KP
Keypoint dataset for airplane
GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
HumanLiker
[NeurIPS2022 spotlight]HumanLiker: A Human-like Object Detector to Model the Manual Labeling Process
Slow-Perception
Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step
Vary
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
Vary-family
Vary-tiny-600k
Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)
Vary-toy
Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)
Ucas-HaoranWei's Repositories
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Ucas-HaoranWei/Vary
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
Ucas-HaoranWei/Vary-toy
Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)
Ucas-HaoranWei/Slow-Perception
Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step
Ucas-HaoranWei/Vary-tiny-600k
Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)
Ucas-HaoranWei/Vary-family
Ucas-HaoranWei/Aircraft-KP
Keypoint dataset for airplane
Ucas-HaoranWei/HumanLiker
[NeurIPS2022 spotlight]HumanLiker: A Human-like Object Detector to Model the Manual Labeling Process
Ucas-HaoranWei/CornerAffinity
[IJCAI2022] Corner Affinity: A Robust Grouping Algorithm to Make Corner-guided Detector Great Again
Ucas-HaoranWei/Fox
official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"
Ucas-HaoranWei/Ucas-HaoranWei