In this project, using the camera a scene is captured and then using deep learning model the scene caption is generated in text form, which is then converted to speech using google text to speech translation.
apoorvjain25/Image-description-for-Blind
In this project, using the camera a scene is captured and then using deep learning model the scene caption is generated in text form, which is then converted to speech using google text to speech translation.
PythonMIT