Importance Sampling in Generator
cdgiv opened this issue · 2 comments
cdgiv commented
Can you elaborate on this part of the code?
https://github.com/geek-ai/irgan/blob/master/ltr-gan/ltr-gan-pointwise/ltr_gan_d_nn_g_nn.py#L125-L129
I am trying to understand where the importance sampling came from - I do not see it mentioned anywhere in the papers for IRGAN, REINFORCE, GANs, RankNet, etc.
Is this just from Monte Carlo sampling? If so, can you provide a good reference?
Thanks!
SongFGH commented
I also have this question
huangtinglin commented
It's a trick widely used in RL. You can search for the keyword: off-policy, importance sampling. Hope it can help you!