issue about rendering the sample

Question

issue about rendering the sample

sygyq305 opened this issue a year ago · 13 comments

In the visualize.py, the param, "interval", is 1000/fps. May I know why this is set by default or why it is 1000.

Answer 1 · 2023-08-14T23:53:51.000Z

I think it is because the units of interval are [mSec]

Answer 2 · 2023-08-15T06:21:51.000Z

Thanks.
I have a confusion about the rendering duration. In the default, the fps is 20. In the paper-model's param, 'num_frames' is 60. So all rendered sample durations are 3 seconds. What should I do if I want to render 6-second-sample.
I once tried to change 'num_frames' from 60 to 120. Although the total duration has been to 6 seconds, it only adds 3 seconds to the rendering time. The first three seconds are the same as before, and the last three seconds are stationary.

Answer 3 · 2023-08-15T17:02:03.000Z

This model was trained for motions with a fixed length of 60 frames. Please try and retrain it for 120 frames.

Answer 4 · 2023-08-17T08:33:32.000Z

Thanks.
I have another confusion about the rendering. All rendered motions used paper-model can appear the rendered frames and their text description used during training. Just like Figure 4 in the paper.

Answer 5 · 2023-08-17T15:44:53.000Z

Do you ask about the appearance? If so, they all have the same appearance as in Fig 4
https://drive.google.com/file/d/1F8VLY4AC2XPaV3DqKZefQJNWn4KY2z_c/view?usp=sharing

Answer 6 · 2023-08-21T10:30:27.000Z

No.
My question is that fig.4 is the training-phase frame. Does the inference phase also have such frames

Answer 7 · 2023-08-25T22:51:27.000Z

At inference, you do text-to-motion - i.e. encode text and decode notion, so the rendered frames are unnecessary.

Answer 8 · 2023-08-28T03:14:26.000Z

What can I do if I want to see more details about the inference frames.

Answer 9 · 2023-08-28T03:30:13.000Z

Do you mean that you want to render the results with a more elaborate body model such as SMPL, instead of the stick figures?

Answer 10 · 2023-08-28T04:35:18.000Z

No, my mean is that I want to know how the frame numbers are allocated for each action.
For example, the input text is ’360 degree left jump and standing and turning back‘. There are three actions. They are jump, stand and turn back. How are the frame numbers of jump, stand and turn back allocated.

Answer 11 · 2023-08-28T19:47:38.000Z

Got you. So they are not explicitly allocated by the user, but by the model. If you want to interpret the model decisions you can try and adapt transformer interoperability papers to the motion domain. Anyway, that isn't a trivial one.

Answer 12 · 2023-08-29T13:09:06.000Z

Thanks.
And which files are the transformer interoperability papers

Answer 13 · 2023-08-29T17:09:21.000Z

Sorry, I'm not enough familiar with this field.