xunyiljg opened this issue 5 years ago · 0 comments
Dqn_prioritized is not multiplied by importance sampling weight in training, whether this is a problem.