/Alpha-Co-Vision

A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large Language Models base to create interactive video descriptions.

Primary LanguagePythonMIT LicenseMIT

Watchers