iamhectorotero/rlai-exercises

Error in solution to exercise 3.9

sachingodishela opened this issue · 1 comments

Reward Sequence: 2, 7, 7, 7, 7,....
G0 = R1 + (0.9) * G1
G1 = R2 + (0.9) * G1

=> G0 = 65
=> G1 = 70

Thanks for reporting @sachingodishela!