/Video_Scene_Search_Image_Text

Search around your own videos, navigate through scenes from image and text queries.

Primary LanguagePythonMIT LicenseMIT

Video_Scene_Search_Image_Text(Show and Tell Model CNN-RNN)

Search around your own videos, navigate through scenes from image and text queries.

Video Scene Search Using Image and Text

  • Train your own videos to search
  • Search Videos and Video Scene Using Image
  • Search Videos and Video Scene Using Text
  • Search through multiple videos
  • Easily work through REST API

Dependencies

Pretrained Model

Model Information

  • Show and Tell Model
  • Encoder-Decoder Neural Network
  • Encoder - Inception v3 image recognition model pretrained on the ILSVRC-2012-CLS image classification dataset.
  • Decoder - LSTM Network Trained on Captions represented with an embedding model
  • Beam Search as Caption Generator

Instructions

  • Download Pretrained Model and extract in the models folder
  • Run REST API with 'video_rest.py'

Training your Videos

Search from Videos