DAMO-NLP-SG/Video-LLaMA

Evaluation on large-scale dataset

hritam-98 opened this issue · 1 comments

Hello,
Thank you for your amazing work. The demo runs fine for a single video.

I'm curious if there are any provisions for generating inference on a larger dataset of videos, each accompanied by corresponding text questions. Additionally, I'm interested to know if there's an API available for this purpose.

Looking forward to your insights on this matter.

I need this too! Do u know how to evaluate on large-scale dataset now?