get_state_value add two more options to calculate state_value

Question

get_state_value add two more options to calculate state_value

Closed this issue 2 years ago · 2 comments

From jbg
rollout with "random moves" = sampling from the weighted ranked list of DipNet suggested moves

From Jonathan Kummefeld
one alternative value function is some combination of:
(a) Change in SCs
(b) How close the proposed orders are to what we would do anyway (closer is better)
(c)How many cross-power supports happen if you make the moves
Maybe 10(a) + (b) + 2(c) ? That means SCs dominate, and the others are tie-breakers, with move changes that lead to supports being worth changing for

Answer 1 · 2022-12-06T22:58:41.000Z

This can be closed if it's completed. @wwongkamjan

Answer 2 · 2022-12-07T00:56:27.000Z

yes it's done.