optical flow images

Question

optical flow images

Opened this issue 7 years ago · 8 comments

Can you tell me how are you feeding optical images which you got from video frames?
I am just confused.

Answer 1 · 2018-06-25T03:37:57.000Z

Hi,
Basically we compute the optical flow of every two ajacent frames and then stack them together to form the optical flow for the whole sequence. So when the number of frames is 10, we will have 9 optical flow images, each of them have the shape (im_size x im_size x 2), so the shape of stacked optical flow image will be (im_size x im_size x 18).

Answer 2 · 2018-06-25T04:08:15.000Z

How are you cropping ? If you can just tell me if this is right or wrong it would be a great help.
For Running I need to crop the human and resize it so that in each frame they will look as if they are just at one point and moving their hand and legs only.

For bending, jumping,pulling , handshake I need to take a same boundary for all the frames where the motion is taking place. right ?

Answer 3 · 2018-06-26T02:25:00.000Z

I think the frames in your screenshot are good : ), after you crop the human and resize all frames into the same size, you can compute the stacked optical flow.

Answer 4 · 2018-06-27T06:26:59.000Z

Can i implement two stream convolutional neural netowrk on multi view camera video datasets?
The dataset(I3DPost) i am using have taken actions from 8 different angles.
Sorry to ask this silly question. I am newbie in this field.

Answer 5 · 2018-06-27T06:30:32.000Z

Camera is fixed. But the object is performing action looking at 8 different directions (facing the camera ,away the camera ,..........etc)

Answer 6 · 2018-06-29T22:56:31.000Z

I think the two-stream model assumes the input optical flow to be from a single video sequence. It's not good to stack flows extracted from 8 videos from different angles, since they cannot be considered as a complete sequence.
If you want to use the model on multi-view data, I guess maybe it's a better way to train one model for one view (assume you know which sequence is taken from which view in the dataset), and then use something like a voting mechanism to give the final prediction.

Answer 7 · 2021-01-16T16:58:55.000Z

Hi,
Basically we compute the optical flow of every two ajacent frames and then stack them together to form the optical flow for the whole sequence. So when the number of frames is 10, we will have 9 optical flow images, each of them have the shape (im_size x im_size x 2), so the shape of stacked optical flow image will be (im_size x im_size x 18).

Hi, may I know what is the exact mechanism you used for stacking? I have generated the images but I am not sure how to stack and input them input the motion model.

Answer 8 · 2024-01-24T18:49:50.000Z

感谢您的来信!请您有事去我的新博客：[恒星]http://my.oschina.net/wuao/blog生命不灭的恒星当梦想擦撞到现实请不要轻言放弃！