get_state_value add two more options to calculate state_value
Closed this issue · 2 comments
wwongkamjan commented
From jbg
rollout with "random moves" = sampling from the weighted ranked list of DipNet suggested moves
From Jonathan Kummefeld
one alternative value function is some combination of:
(a) Change in SCs
(b) How close the proposed orders are to what we would do anyway (closer is better)
(c)How many cross-power supports happen if you make the moves
Maybe 10(a) + (b) + 2(c) ? That means SCs dominate, and the others are tie-breakers, with move changes that lead to supports being worth changing for
kartik2112 commented
This can be closed if it's completed. @wwongkamjan
wwongkamjan commented
yes it's done.