Saving and closing in the middle of training
theaimeltus opened this issue · 0 comments
theaimeltus commented
So, I'm currently training on a dataset but since I have only 1 GPU, the process takes very long and it can only save a checkpoint automatically after 1000 steps... so is there a way to save and close checkpoint manually at any intervals?