An implementation of Reinforce Algorithm with a parameterized baseline, with a detailed comparison against whitening.
##Performance of Reinforce trained on CartPole

##Average Performance of Reinforce for multiple runs

##Comparison of subtracting a learned baseline from the return vs. using return whitening
