EleutherAI/DALLE-mtf

Anneal gumbel softmax temperature during training

sdtblck opened this issue · 0 comments

lucidrains/DALLE-pytorch#10 (comment)

"wow! temperature feature is awesome! Gradually decreasing it from 5 to 0.05 over 5 epochs and convergence is really fast as well as results look much better!!!"