Reinforce With Baseline in PyTorch

An implementation of Reinforce Algorithm with a parameterized baseline, with a detailed comparison against whitening. ##Performance of Reinforce trained on CartPole alt text

##Average Performance of Reinforce for multiple runs alt text

##Comparison of subtracting a learned baseline from the return vs. using return whitening alt text