Implicit compatible function approximation
Opened this issue · 0 comments
tspooner commented
Find a neat solution to using the policy score function as the features of an LFA instance. The issue at the moment is that the project
method only takes a single input. The score function variant would also require the action. There are loads of ways to do this, but we want something that won't require rethinking later down the line.