Pinned Repositories
GPT-4V_OCR
Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)
PageNet
Official implementation of PageNet (IJCV 2022)
SCUT-HEAD-Dataset-Release
SCUT HEAD is a large-scale head detection dataset, including 4405 images labeld with 111251 heads.
SPTS
Official implementation of SPTS: Single-Point Text Spotting (ACM MM 2022 Oral)
UPOCR
Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)
ViTEraser
Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 2024)
shannanyinxiang's Repositories
shannanyinxiang/SPTS
Official implementation of SPTS: Single-Point Text Spotting (ACM MM 2022 Oral)
shannanyinxiang/PageNet
Official implementation of PageNet (IJCV 2022)
shannanyinxiang/ViTEraser
Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 2024)
shannanyinxiang/UPOCR
Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)
shannanyinxiang/SCUT-HEAD-Dataset-Release
SCUT HEAD is a large-scale head detection dataset, including 4405 images labeld with 111251 heads.