Requeuing not triggered on node startup fail
Opened this issue · 1 comments
XaverStiensmeier commented
Currently, jobs that are scheduled to nodes that fail during startup might not be requeued.
XaverStiensmeier commented
We have opened a ticket, but will also try to fix it by upgrading the Slurm version