vqgan result

Question

vqgan result

Winnie202 opened this issue 2 years ago · 2 comments

Thank you for sharing the code,I used taming-tranformer to did the image reconstruction for Street View,but smaller text sections don't work well. If i use this model to train this type of dataset can optimize the reconstruction results of vqgan with small text,like these:

Answer 1 · 2023-05-10T14:22:03.000Z

Hi @Winnie202, awesome results!! I'm glad you could reconstruct the small texts. We could try generating synthetic scene-text as a follow-up

Answer 2 · 2023-05-11T01:38:00.000Z

awesome results!! I'm glad you could reconstruct the small texts. We could try generating synthetic scene-text as a follow-up

Do you have any good suggestions for improving the reconstruction of the text in these scenes