Need of two encoders and decoder of training

Question

Need of two encoders and decoder of training

krips89 opened this issue 4 years ago · 8 comments

krips89 commented 4 years ago

What is the need of two encoder (encoder and e_ema) during the trainign?
And the same goes for the decoder

Answer 1 · 2020-10-08T01:27:03.000Z

It is running average of the encoder/decoder. Using running average of the model will show better results.

Answer 2 · 2020-10-08T10:15:50.000Z

Any literature to back this up?

Answer 3 · 2020-10-08T11:04:30.000Z

I don't know which paper first tried to apply ema to the gan training. But you can refer to the papers like this. https://arxiv.org/abs/1806.04498

Answer 4 · 2020-10-08T17:32:38.000Z

Thank you for the answer, does the official stylegan2 implement that?
I'm closing this issue.

Answer 5 · 2020-10-09T01:12:25.000Z

Yes, stylegan2 also uses it.

Answer 6 · 2021-03-07T05:48:55.000Z

Hi, thanks for your implementation.
Is there any criterion to difine accum (the hyper-param for ema)
code here
accum = 0.5 ** (32 / (10 * 1000))
@rosinality

Answer 7 · 2021-03-07T06:22:05.000Z

@nobodypengium I took it from official stylegan2 implementations. It seems like that authors defined it in relation of number of seen images. (which is stylegan2 authors preferred way to define hyperparameters.) I think 0.999 ~ 0.9999 (biggan) works well. (Though stylegan2 uses about 0.998)

Answer 8 · 2021-03-07T07:48:46.000Z

Thanks for your answer!