Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering
Primary LanguagePython