DSA-Model
Model base on Show, Attend and Tell: Neural Image Caption Generation with Visual Attentiont, Soft Attention. Mscoco model base on coldmanck implementation Flickr 8k & 30K base on fuqichen implementation
- CNN Layer Model: VGG16 (default)
- RNN Layer Model: LSTM (default)
- Datasets: MS-COCO, Flickr8k & Flickr30k
- Scoring: BLEU_1, BLEU_2, BLEU_3, BLEU_4, METEOR, ROUGE_L, CIDEr
Requirements
- DATA zip file
- Check each implementation README.md of each dataset
Installation
- Check each implementation README.md of each dataset