Question about the high-resolution pixel decoder

Question

Question about the high-resolution pixel decoder

Closed this issue 10 months ago · 2 comments

Hi,

Very insightful work!
A question is about the details of the new high-resolution pixel decoder, which supports to generate high resolution, muliple aspect ratios, and high aesthetics images.
Could you please release some details of the training process?
Thanks a lot!

Best regards

Answer 1 · 2023-11-19T12:29:55.000Z

The high-resolution pixel decoder is trained with the same strategy as the original one. Given an input image, it takes the discrete the visual token ID tokenized by our visual tokenizer as condition, and aims to recover the original input.

Answer 2 · 2023-11-20T08:50:31.000Z

OK, thanks for your reply! jy0205 ***@***.***> 于2023年11月19日周日 20:30写道：

…

The high-resolution pixel decoder is trained with the same strategy as the original one. Given an input image, it takes the discrete the visual token ID tokenized by our visual tokenizer as condition, and aims to recover the original input. — Reply to this email directly, view it on GitHub <#6 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AR4KH4MRGFNNHKEIEWP4U7DYFH3U3AVCNFSM6AAAAAA7Q75XDSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQMJXHA2DANRSG4> . You are receiving this because you authored the thread.Message ID: ***@***.***>