Epsilon Exploitable Opponent
Jamesflynn1 opened this issue · 1 comments
Jamesflynn1 commented
For EV experiments, might require research.
Jamesflynn1 commented
MCCFR produces an Epsilon Exploitable Opponent.
Run for differing number of iterations or vary parameters for different grades of opponents.
Use OpenSpiel MCCFR, requires conversion between AveragePolicy and TabularPolicy object.
Requires a wrapper to run and store the policy and configure parameters (if any). Get this working asap.