Is there any way to speed up the inference

Question

Is there any way to speed up the inference

Opened this issue 10 months ago · 4 comments

Is there any way to speed up the inference except lowering the number of frames? Does reducing video resolution speed up the inference?

Answer 1 · 2024-03-21T13:57:48.000Z

I also have the same problem. I used one 4090 and it needs about 3s for only one sample

Answer 2 · 2024-03-29T14:20:40.000Z

@Coronal-Halo the videos are internally reduced to 224x224 in dimension, so don't expect that to have a (noticeable) impact.

@cm-xcju 3 seconds for 1 video? That's actually not bad at all, since you have to imagine that you're essentially doing 8 times a LLaVA call, except that it's now batched.

Answer 3 · 2024-04-02T23:13:35.000Z

Is there a way to perform batch inference on a batch of videos?

Answer 4 · 2024-04-03T06:46:34.000Z

Is there a way to perform batch inference on a batch of videos?

If you use the search bar, you can find this issue: #40 (comment)