mzjb/DeepH-pack

How to restart training model

fangrh opened this issue · 2 comments

Hi,

When I run the example of using ABACUS, everything gone well except the training is canceled by the slurm system due to the time limit.

Because the maximum time of using the GPU accelerate card is limited for one submit in our cluster. I wander if I can restart the training when next submit ?

OK, it works, thank you very much.