huanngzh/threestudio-dreamwaltz

[misc] support High-Resolution Canonical Avatars && image-to-avatars

Closed this issue · 3 comments

Hello, great job on your work! I have a couple of questions regarding it:

  1. Can the current codebase support High-Resolution Canonical Avatars? What is its computational overhead compared to 64x64 avatars?

  2. Can the current task accommodate image-to-avatar conversion?

  1. I check the improvement of DreamWaltz about resolution. I think it can be easily reproduced in the current codebase by changing the resolution. I'm not sure about the specific computing resource consumption, and I haven't reproduced it yet :)
  2. Maybe you can introduce some methods like zero123, IP-Adapter as guidance?

I think you need to adjust some parameters such as guidance_scale, weighting_strategy etc. In addition, in my code, there is a parameter guidance_eval in the config, which indicates how many steps to verify the image inference results. You can judge whether lora works based on this.