Remove T_MAX constraint in reward computation
Closed this issue · 0 comments
Cernewein commented
Even though this has no real impact on learning, it is incrorrect
Closed this issue · 0 comments
Even though this has no real impact on learning, it is incrorrect