An implementation of the simplified Value and Policy Search algorithm VAPS(1) originally presented in:
Leonid Peshkin, Nicolas Meuleau, and Leslie Kaelbling. "Learning Policies with External Memory." International Conference on Machine Learning (ICML), 2001. [Paper]
See the short write-up and detailed presentation for more information.