Cernewein/heating-RL-agent

Remove T_MAX constraint in reward computation

Closed this issue · 0 comments

Even though this has no real impact on learning, it is incrorrect