/Vision-Voice

Image captioning with dictation that uses InceptionV3, LSTM and Speech Synthesis Library to generate description of contents inside the image, and also dictate the content.

Primary LanguageHTML

Vision-Voice

Title_1.mp4