A curated list of “Temporally Language Grounding” and related area
- TALL: Temporal Activity Localization via Language Query - Gao et al,
ICCV 2017
. [code1] [code2] - Localizing Moments in Video with Natural Language - Hendricks et al,
ICCV 2017
. [code]
- Localizing Moments in Video with Temporal Language - Hendricks et al,
EMNLP 2018
. - MAC: Mining Activity Concepts for Language-based Temporal Localization - Ge et al,
WACV 2018
. [code1] [code2] - Temporally Grounding Natural Sentence in Video - Chen et al,
EMNLP 2018
. - Cross-modal Moment Localization in Videos - Liu et al,
ACM MM 2018
. [code] - Attentive Moment Retrieval in Videos - Liu et al,
SIGIR 2018
. [code] - Multi-modal Circulant Fusion for Video-to-Language and Backward - Wu et al,
IJCAI 2018
.
- Localizing Natural Language in Videos - Chen et al,
AAAI 2019
. - Semantic Proposal for Activity Localization in Videos via Sentence Query - Chen et al,
AAAI 2019
. - Multilevel Language and Vision Integration for Text-to-Clip Retrieval - Xu et al,
AAAI 2019
. [code] - To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression - Yuan et al,
AAAI 2019
. [code] - Read,Watch, and Move: Reinforcement Learning for Temporally Grounding Natural Language Descriptions in Videos - He et al,
AAAI 2019
. [code] - Tripping through time Efficient Localization of Activities in Videos - He et al,
arXiv preprint 2019
.