How to restart training model
fangrh opened this issue · 2 comments
fangrh commented
Hi,
When I run the example of using ABACUS, everything gone well except the training is canceled by the slurm system due to the time limit.
Because the maximum time of using the GPU accelerate card is limited for one submit in our cluster. I wander if I can restart the training when next submit ?
mzjb commented
fangrh commented
OK, it works, thank you very much.