awjuliani/DeepRL-Agents

A possible typo?

XBLDev opened this issue · 1 comments

In https://github.com/awjuliani/DeepRL-Agents/blob/master/Vanilla-Policy.ipynb, there's a comment:

#Get our reward for taking an action given a bandit.

That tutorial uses the CartPole-v0 environment right? I don't think there is a bandit in that problem :-)

It should be CartPole-v1