ictnlp/DSTC8-AVSD
We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD) paper "Bridging Text and Video: A Universal Multimodal Transformer for Video-Audio Scene-Aware Dialog".
PythonMIT
Issues
- 2
- 2
Model Weights
#1 opened by vibhavagarwal5 - 5
- 2
- 15
Question on reproduction
#2 opened by cshanjiewu