Remove T_MAX constraint in reward computation

Question

Closed this issue 5 years ago · 0 comments

Even though this has no real impact on learning, it is incrorrect