Issues
- 2
Calculating returns with signed rewards
#7 opened by backpropper - 1
- 2
SIL Value update
#3 opened by boscotsang - 0
entropy in SIL policy loss
#6 opened by gabrieledcjr - 1
Policy 'lstm' doesn't work
#5 opened by HaozhengLi - 0
Key-Door-Treasure
#4 opened by anagorko - 2
np.sign(rewards)
#2 opened by bhairavmehta95