Multimodal sentence summarization, aiming to generate a brief summary of the source sentence and image
Primary LanguagePython