StepNeverStop/RLs

Implement Model Saving Mechanism

Closed this issue 3 years ago · 0 comments

StepNeverStop commented 4 years ago

include save model based on:

training time cost
score performance of under-training policy
training timestep
...