long-video-understanding

There are 7 repositories under long-video-understanding topic.

  • rese1f/MovieChat

    [CVPR 2024] 🎬💭 chat with over 10K frames of video!

    Language:Python456106837
  • XLearning-SCU/2024-ICLR-Norton

    Multi-granularity Correspondence Learning from Long-term Noisy Videos [ICLR 2024, Oral]

    Language:Python1021117
  • RenShuhuai-Andy/TESTA

    [EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding

    Language:Python42203
  • zjr2000/LLMVA-GEBC

    Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)

    Language:Python29232
  • zjr2000/GVL

    Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos

    Language:Python25277
  • kkahatapitiya/LangRepo

    Language Repository for Long Video Understanding

    Language:Python223
  • SCZwangxiao/DEPICT

    a multi-modal video caption dataset with richer annotation

    Language:Python