/fix_style

Primary LanguagePython

StyleNet: Generating Attractive Visual Captions with Styles

* under development

StyleNet is a novel framework to address the task of generating attractive captions for images and videos with different styles. A novel model component, named factored LSTM is used in StyleNet, which automatically distills the style factors in the monolingual text corpus.

framework Imgur

examples of generated captions Imgur

Description

Requirement

  • python 3.5.3
  • pytorch 0.2.0
  • torchvision 0.1.9
  • numpy 1.13.3
  • scikit-image 0.13.1
  • nltk 3.2.5