geek-ai/irgan

Importance Sampling in Generator

cdgiv opened this issue · 2 comments

cdgiv commented

Can you elaborate on this part of the code?
https://github.com/geek-ai/irgan/blob/master/ltr-gan/ltr-gan-pointwise/ltr_gan_d_nn_g_nn.py#L125-L129

I am trying to understand where the importance sampling came from - I do not see it mentioned anywhere in the papers for IRGAN, REINFORCE, GANs, RankNet, etc.

Is this just from Monte Carlo sampling? If so, can you provide a good reference?

Thanks!

I also have this question

It's a trick widely used in RL. You can search for the keyword: off-policy, importance sampling. Hope it can help you!