/Multimodal-Transformers

List of papers and resources for multimodal transformers

Multimodal-Transformers

List of papers and resources for multimodal transformers

  1. Multimodal Transformer for Unaligned Multimodal Language Sequences, ACL 2019, https://github.com/yaohungt/Multimodal-Transformer

  2. SELF-SUPERVISED LEARNING WITH CROSS-MODAL TRANSFORMERS FOR EMOTION RECOGNITION, SLT 2020

  3. Multimodal Transformer Fusion for Continuous Emotion Recognition, ICASSP 2020

  4. Multimodal transformer models https://github.com/georgian-io/Multimodal-Toolkit

  5. Low Rank Fusion based Transformers for Multimodal Sequences, ACL 2020

  6. Modulated Fusion using Transformer for Linguistic-Acoustic EmotionRecognition, ACL 2020, https://github.com/jbdel/modulated_fusion_transformer

  7. Attending to Emotional Narratives, ACII 2019, https://github.com/frankaging/ACII2019-transformer

  8. VATT: Transformers for Multimodal Self-Supervised Learningfrom Raw Video, Audio and Text, https://arxiv.org/pdf/2104.11178.pdf

  9. Multimodal Cross-and Self-Attention Network for Speech Emotion Recognition, ICASSP 2021