seungeunrho/minimalRL

Wrong formula for calc-target in SAC?

Opened this issue · 0 comments

See https://github.com/seungeunrho/minimalRL/blob/c8efed8481e3cd40e9739cfde220a55522555b57/sac.py#L127C1-L127C54

Shouldn't the formula be target = r + gamma * (1 - done) * (min_q + entropy)?