niluthpol/multimodal_vtt
Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval
Python
Issues
- 1
Features extraction preprocessing.
#22 opened - 2
pretrained_model
#21 opened - 1
vocab.pkl of MSVD
#20 opened - 1
- 6
the features about MSVD
#18 opened - 2
questions about MSVD
#17 opened - 1
Clarification
#16 opened - 1
missing videos
#15 opened - 1
Error whiel training
#14 opened - 1
Error while training
#13 opened - 1
.pkl file of msvd dataset
#12 opened - 6
MSVD dataset
#11 opened - 1
- 1
some questions
#9 opened - 1
some questions about this line
#8 opened - 0
the baseline of VSEPP-ResNet on MSVD
#7 opened - 0
- 1
- 2
- 1
- 0
ask for vocabulary pickle files
#2 opened