om-ai-lab/GroundVLP

GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)

Jupyter NotebookApache-2.0

Watchers