bowang-lab/Graph-Mamba

No further output after ``Start from epoch 0``

BatchClayderman opened this issue · 3 comments

Hi, guys.
When I launch the Python via python main.py on my Windows, it works well and exits normally.
Subsequently, I launch the Python via python main.py --cfg configs/Mamba/peptides-func-EX.yaml wandb.use False on my Windows.
After it shows Num parameters: 373018 and Start from epoch 0, there is no more output in 3 hours.
After debugging, I know that it is handling train_dict[cfg.train.mode](loggers, loaders, model, optimizer, scheduler).
I am confusing about this phenomenon. There are four guesses from my perspective.

  1. The training itself has no output. Thus, nothing more will show on the console until it ends.
  2. The training itself has output. However, it takes a lot of time to train one epoch. Thus, it looks like that it is stuck.
  3. The training is using CPU.
  4. The training is abnormal.
    However, I can hardly know what happens and how to handle this issue. Do you have any ideas?
    Thank you very much.