A possible typo?
XBLDev opened this issue · 1 comments
XBLDev commented
In https://github.com/awjuliani/DeepRL-Agents/blob/master/Vanilla-Policy.ipynb, there's a comment:
#Get our reward for taking an action given a bandit.
That tutorial uses the CartPole-v0 environment right? I don't think there is a bandit in that problem :-)
araknadash commented
It should be CartPole-v1