video-conversation

There are 3 repositories under video-conversation topic.

  • mbzuai-oryx/Video-ChatGPT

    [ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

    Language:Python1.2k15121107
  • mbzuai-oryx/Video-LLaVA

    PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models

    Language:Python241141611
  • mbzuai-oryx/VideoGPT-plus

    Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

    Language:Python21352614