Initial values of mu and sigma

Question

Initial values of mu and sigma

Closed this issue 7 years ago · 2 comments

david-bernstein commented 7 years ago

According to the Eslami at al supplemental materials, it seems that line 45 of run_gqn.py should read

mu, sigma = mu_i, sigma_i

instead of

mu, sigma = mu_f, sigma_f

The subscripts appear to indicate initial and final values of the annealed quantities.

Answer 1 · 2018-09-11T23:40:14.000Z

Does it have any functional implications to change it from what it is currently?

Answer 2 · 2018-09-11T23:45:13.000Z

Not really. For the first epoch the learning rate is the final value (5e-5). After the first epoch it then jumps up to near the initial value and then begins its slow annealing descent. So for the first pass through the data the learning rate is too low. It is cosmetic as the first epoch doesn't really count for anything convergence-wise.