Inference on single video

Question

Inference on single video

nikky4D opened this issue 4 years ago · 3 comments

Hi,

do you have a demo.py/ipynb that I can use to run inference on a single video to see the captions generated? If not, can you describe how I can go about making this setup?

Thanks

Answer 1 · 2020-07-09T06:17:55.000Z

Encoder part: use ResNeXt, ECO and Semantic Detection Network to extract features from a video clip.
Decoder part: use those features as inputs to the captioning model.

Answer 2 · 2020-09-28T10:21:29.000Z

@WingsBrokenAngel
Can you please provide code/repo links on how to go about feature extraction for encoder part?

Answer 3 · 2020-10-03T16:06:22.000Z

@WingsBrokenAngel
Can you please provide code/repo links on how to go about feature extraction for encoder part?

ResNeXt could be found in tensornets and ECO could be found in ECO-efficient-video-understanding.