tspooner/rsrl

Implicit compatible function approximation

Opened this issue · 0 comments

Find a neat solution to using the policy score function as the features of an LFA instance. The issue at the moment is that the project method only takes a single input. The score function variant would also require the action. There are loads of ways to do this, but we want something that won't require rethinking later down the line.