Results

Question

Results

snarb opened this issue 2 years ago · 5 comments

snarb commented 2 years ago

Hello. Thanks for your work.
What are training results and metrics that you get on any dataset with thus code?

Answer 1 · 2022-08-02T07:48:30.000Z

Hi @snarb, sorry for late reply.

For visualizing results, you can play around in this colab link
For metrics to evaluate, I compare LPIPS score (perceptual loss), L2 loss (log_gaussian loss) and look at the visual quality of the images in the validation dataset between different runs

Answer 2 · 2022-08-04T13:28:51.000Z

thnaks @thuanz123 . I have found another VQ-GAN implementation https://github.com/lucidrains/parti-pytorch/blob/main/parti_pytorch/vit_vqgan.py. Can you please tell what are there significant differences ? Looks like you have added ideas from [RQ-VAE]

Answer 3 · 2022-08-04T13:39:44.000Z

Lucidrains's implementation for ViT-VQGAN is very complex as he applies a lots of additional technique whereas my code is only plain simple ViT architecture. Also training vit-vqgan using his code is very slow compared to mine, and the image quality is not better much when I train for same number of iterations. This is may be bias since I haven't carefully try his code
Except additional code for RQ-VAE, my implementation is the closest one to the author unpublic implementation since I ask him a lot of question.

Answer 4 · 2022-08-04T13:48:16.000Z

Also a big different from lucidrain code is that my code support multi-node multi-gpu thanks to pytorch lightning, while his code seem not support this

Answer 5 · 2022-08-04T14:08:53.000Z

I will close this for now. If you have any other question, feel free to reopen.