How to remove text information in denoising UNet based on the existing code?
Opened this issue · 1 comments
RuiTianHIT commented
Dear author!
We are interested in your high-quality and excellent work.
We want to explore the ability of depth estimation when the model does not introduce text information. As shown in the image.
However, we forced self.conditioning key == None, and an error occurred during this process.
Does the author have any good solution?
Thank you very much, I look forward to your reply!
wl-zhao commented
You can just feed a zero text embedding to the UNet