msinto93/D4PG

ABOUT: target network's Z-atom values

Closed this issue · 0 comments

Thank you for sharing code.
I'm confused about the target network's Z-atom values in line 127 in the file "learner.py".
if you want to get the the target network's Z-atom values, is that not the "output_logits",
if you want to get the "self.z_atoms = tf.lin_space(v_min, v_max, num_atoms)"(utils/network.py line 48) as target_Z_atoms, it will always be same. is that make no sense?