selenium
MMUTF: Multimodal Multimedia Event Argument Extraction with Unified Template Filling [EMNLP'24]
Multimodal event extraction via prompts and enhanced visual-textual semantic fusion