haz opened this issue 3 years ago · 0 comments
Some techniques require the full state space in order to work. We can capture this as the complete set of transitions -- each $(s,a,s')$ transition being a trace of length 2.