bias annealing weight updates

Question

zacwellmer opened this issue 7 years ago · 1 comments

I could be wrong but it does not seem that you are annealing the bias with important sampling as suggested in the paper(3.4).

w_i = (1/N * 1/P(i))^beta

I think you would have to multiply this w_i term with your gradients

Answer 1 · 2017-10-20T06:53:21.000Z

My apologies, I thought you had included prioritized replay.