No further output after ``Start from epoch 0``
BatchClayderman opened this issue · 3 comments
BatchClayderman commented
Hi, guys.
When I launch the Python via python main.py
on my Windows, it works well and exits normally.
Subsequently, I launch the Python via python main.py --cfg configs/Mamba/peptides-func-EX.yaml wandb.use False
on my Windows.
After it shows Num parameters: 373018
and Start from epoch 0
, there is no more output in 3 hours.
After debugging, I know that it is handling train_dict[cfg.train.mode](loggers, loaders, model, optimizer, scheduler)
.
I am confusing about this phenomenon. There are four guesses from my perspective.
- The training itself has no output. Thus, nothing more will show on the console until it ends.
- The training itself has output. However, it takes a lot of time to train one epoch. Thus, it looks like that it is stuck.
- The training is using CPU.
- The training is abnormal.
However, I can hardly know what happens and how to handle this issue. Do you have any ideas?
Thank you very much.