Pinned Repositories
TubeDETR
[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
Dual-Stream-Transformer-for-Generic-Event-Boundary-Captioning
LCVSL
TextKG
CGSTVG
[CVPR 2024] Context-Guided Spatio-Temporal Video Grounding
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities