Pinned Repositories
ControlCap
[ECCV 2024] ControlCap: Controllable Region-level Captioning
DynRefer
[CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution
FANet
[ICME 2023] Explore Faster Localization Learning For Scene Text Detection
FlowText
[ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation
GenPromp
[ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization
Kosmos25Vqa_eval
Repo1127
TextVR
[PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension
zhaoyuzhong.github.io-main
Resume
callsys's Repositories
callsys/ControlCap
[ECCV 2024] ControlCap: Controllable Region-level Captioning
callsys/GenPromp
[ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization
callsys/DynRefer
[CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution
callsys/TextVR
[PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension
callsys/FlowText
[ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation
callsys/FANet
[ICME 2023] Explore Faster Localization Learning For Scene Text Detection
callsys/zhaoyuzhong.github.io-main
Resume
callsys/Kosmos25Vqa_eval
callsys/Repo1127