training

Question

training

Opened this issue 5 months ago · 4 comments

Do you only train controlnet?

Answer 1 · 2024-02-19T09:07:43.000Z

Yes, we only train ControlNet about v1 pretrained checkpoint, and you also train the temporal layers with a sufficient amount of data if needed.

Answer 2 · 2024-02-19T09:56:03.000Z

@MFaceTech Thanks for your reply. I notice you use first frame as ref image and condition image latents, which cannot fit the long-range video inference. Have you test long-video? Why not just random choose one frame from the whole video as the ref image?

Answer 3 · 2024-02-19T12:12:06.000Z

@jiangzhengkai This is a good observation, and we have also noticed that using the first frame as a reference image is not favorable for long-range video inference. This issue has been addressed in v1.1, and we will revise the code and release the models soon.

Answer 4 · 2024-02-20T06:35:03.000Z

@MFaceTech can you introduce how to enable long-range video inference briefly in technically?