tf-image-caption

This is an implementation of the paper Show and tell: A neural image caption generator (CVPR2015) with tensorflow. I trained the model under two dataset.

net flow

preprocess_image_mscoco.py
preprocess_caption_mscoco.py

train_NIC_mscoco.py
test_NIC_mscoco.py

restore from model-100

captions	content
gt_1	a big airplane flying in the big blue sky
gt_2	large , two decked , four airliner in flight
gt_3	an airfrance jet airplane flying in the sky
gt_4	a big plane with airfrance on the side of it
gt_5	an air france air plane in mid flight
predicted	a large commercial airplane flying in the sky

captions	content
predicted	a group of people walking down a street

preprocess_image_ai.py
preprocess_caption_ai.py

train_NIC_ai.py
test_NIC_ai.py

20 epoches

captions	content
gt_1	一个穿着裙子的女孩双手拿着东西站在宽阔的草地上
gt_2	宽阔的草地上站着一个双手拿着果子的孩子
gt_3	草地上一个披着长发的女孩在亲水果
gt_4	茂盛的草地上有一个穿着白色的连衣裙的女孩在亲吻水果
gt_5	绿油油的草地上站着一个双手拿着水果的女孩
predicted	一个双手拿着花的女人站在茂盛的草丛里