argonne-lcf/user-guides

Aurora doc suggestion for known issues page

Closed this issue · 0 comments

A suggestion, not a bug, for your consideration:

On https://github.com/argonne-lcf/user-guides/blob/main/docs/aurora/known-issues.md you might consider appending "| grep comment" to the end of the following line:

"Jobs may fail to successfully start at times (particularly at higher node counts). If no error message is apparent, then one thing to check is the comment field in the full job information for the job using the command qstat -xf [JOBID]".

This trims the output nicely:
childers@aurora-uan-0010:~/test/xalt> qstat -fx 614296 | grep comment
comment = Not Running: User has reached queue LustreApps running job limit.