Temperature factor missing in SAC !!!

Question

Temperature factor missing in SAC !!!

Opened this issue 2 years ago · 1 comments

log_prob should be multiplied by temperature factor (alpha) when calculating pi_loss in ALL implementations of SAC.

Answer 1 · 2022-03-11T06:31:39.000Z

Also, the output of "log_std_head" layer in Actor network in SAC is no need to go through ReLu, because what we need is the LOG of std instead of std value.