zczcwh/PoseFormer

Inference video in the wild but got wrong result

Opened this issue · 3 comments

Hi,

I used HRNet to generate 2d keypoint npz follow VideoPosed3D format.

I slightly modified your code to inference video in the wild follow VideoPose3D but got the wrong result as below:

image

The result is all entangled.

Can you point out my mistake?

Thanks

Would you please share the modified code?

I also get similar results like yours.

Sorry that iI can't solve your problem, but can you tell me how exactly did you get this video (Should I install a 2D detector and and use its output as the input of poseformer) ?