Epsilon Exploitable Opponent

Question

Jamesflynn1 opened this issue 2 years ago · 1 comments

For EV experiments, might require research.

Answer 1 · 2023-03-22T18:06:52.000Z

MCCFR produces an Epsilon Exploitable Opponent.

Run for differing number of iterations or vary parameters for different grades of opponents.

Use OpenSpiel MCCFR, requires conversion between AveragePolicy and TabularPolicy object.

Requires a wrapper to run and store the policy and configure parameters (if any). Get this working asap.