nawnoes/pytorch-transformer

some confusions

Opened this issue · 2 comments

hey man,
i got confused about the training process of the electra model,
in my point, the generator should be trained first, but in your code, the generator and discriminator is trained together.
can u tell me why?
thank u

Hello, leileilin
I understand electra model pretrain generator and discriminator together.
So, loss is sum of generator loss and discriminator loss using each weight:

# return weighted sum of losses
total_loss = self.gen_weight * gen_loss + self.disc_weight * disc_loss

Thank you :)

Hello, leileilin I understand electra model pretrain generator and discriminator together. So, loss is sum of generator loss and discriminator loss using each weight:

# return weighted sum of losses
total_loss = self.gen_weight * gen_loss + self.disc_weight * disc_loss

Thank you :)

hey man,
Thank you very much. I see.
But if I want to train generator first, then use the trained generator to do sample for the trainging of the discrimator.
are u done expriments for that?
thank u.