Demo predict the video or capture

Question

Demo predict the video or capture

KennyChen880127 opened this issue 2 years ago · 3 comments

Hello everyone! Does anyone know how to predict the video(mp4) or capture? This issuie is very important to me..

I adjust this code https://github.com/mhamilton723/STEGO/blob/master/src/STEGO_Colab_Demo.ipynb

My idea is use torchvision.io.read_video(video) to get the tuple type and then convert to tensor. But it's not work. Hope someone can help me!

Answer 1 · 2023-02-02T13:46:03.000Z

@KennyChen880127 yes that would indeed be the first step.

In particular"

Load the video with torch-vision
Transform each video frame
Apply STEGO to each video frame
Apply the plotting code to each frame, use the matplotlib video maker i have in
https://github.com/mhamilton723/STEGO/blob/master/src/plot_dino_correspondence.py

For guidance

If you make a nice video tool, happy to accept in a PR

Answer 2 · 2023-02-05T08:59:46.000Z

@mhamilton723 Thank you for your reply! I will refer to your suggestion!

Answer 3 · 2023-02-15T08:10:34.000Z

@KennyChen880127 yes that would indeed be the first step.

In particular"

Load the video with torch-vision

Transform each video frame

Apply STEGO to each video frame

Apply the plotting code to each frame, use the matplotlib video maker i have in
https://github.com/mhamilton723/STEGO/blob/master/src/plot_dino_correspondence.py

For guidance

If you make a nice video tool, happy to accept in a PR

I'm sorry to bother you again sir,I'm follow your step and succeeded in predict the video. But I founded the fps is very low, I guess beacuse when predicting the every frame will use LitUnsupervisedSegmenter again? Would you teach the easy way to predict? I very need this code...