kakshak07/Image-Captioining
The objective is to process by generating textual description from an image – based on the objects and actions in the image. Using generative models so that it creates novel sentences. Pipeline type models uses two separate learning process, one for language modelling and other for image recognition. It first identifies objects in image and provides the result to the Inception-v3 model to convert into word embedding vector than into series of LSTM cells to get desired captions.
PythonMIT
Stargazers
- 974714379
- aditya120899
- adityamishra1999
- AKA-th
- AshwiniMahe
- chaitanyadave23VIT University
- daskjfmen
- fleece30Associate Software Engineer @ Novartis
- gzzyyxhFudan University
- Hanzs6
- hcjiAgricultural Genomics Institute at Shenzhen-CAAS
- hemantsinghsikarwar
- Hussain008
- HZM001
- kakshak07India
- nath9777Los Angeles, CA
- Nbeel89
- NileshSikri
- philwilge
- Piyush-Kumar99Cloudera
- ReihanehGhasemianRoshan
- sailfish009freelancer
- TejasNaik1910
- TheDescendant39
- Vincy2King
- wenhuixiong
- wlxiang1