GX77

BytedanceBeijing

Pinned Repositories

TubeDETR
[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
Language:Python167 3 218
Dual-Stream-Transformer-for-Generic-Event-Boundary-Captioning
Language:Jupyter Notebook5 1 30
LCVSL
Language:Python9 1 11
TextKG
Language:Jupyter Notebook9 1 21
CGSTVG
[CVPR 2024] Context-Guided Spatio-Temporal Video Grounding
Language:Python37 2 83
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python19.5k 299 1.4k2.5k