Typo in the Implementation
Akella17 opened this issue · 1 comments
Akella17 commented
Line 121 in d2e587a
current Target: r_t + \gamma * mask + v_{t+1}
correct Target: r_t + \gamma * mask * v_{t+1}
ikostrikov commented
Fixed in d900aa6
Akella17 opened this issue · 1 comments
Line 121 in d2e587a
Fixed in d900aa6