niluthpol/multimodal_vtt

Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval

Python

Issues

Features extraction preprocessing.
#22 opened 3 years ago
1
pretrained_model
#21 opened 4 years ago
2
vocab.pkl of MSVD
#20 opened 4 years ago
1
Video-caption pairs' number of `msvd_captions_test.pkl` in `all` dir is strange
#19 opened 4 years ago
1
the features about MSVD
#18 opened 4 years ago
6
questions about MSVD
#17 opened 4 years ago
2
Clarification
#16 opened 4 years ago
1
missing videos
#15 opened 4 years ago
1
Error whiel training
#14 opened 4 years ago
1
Error while training
#13 opened 4 years ago
1
.pkl file of msvd dataset
#12 opened 4 years ago
1
MSVD dataset
#11 opened 5 years ago
6
some problems about the expermental result
#10 opened 4 years ago
1
some questions
#9 opened 5 years ago
1
some questions about this line
#8 opened 5 years ago
1
the baseline of VSEPP-ResNet on MSVD
#7 opened 5 years ago
0
where is the self.img_enc.train() and self.img_enc.eval()
#6 opened 5 years ago
0
many more feature files missing on Google Drive
#5 opened 5 years ago
1
train/test data missing on Google Drive
#4 opened 5 years ago
2
AttributeError: 'module' object has no attribute 'Vocabulary'
#3 opened 6 years ago
1
ask for vocabulary pickle files
#2 opened 6 years ago
0