/runner

job runner

Primary LanguageJuliaMIT LicenseMIT

job runner

gpu job schedule via file system.

  1. do some cleaning work
    1. if there is a free gpu, and a running job file marked that gpu, then remark the file as [{$GPU}UNKNOWN{$TIME}]
  2. when there is a free gpu
  3. pick a job file (shell script) in jobs/queue/
  4. move the job under jobs/ and mark as [{$GPU}RUN{$TIME}]
    1. do a backup [{$GPU}BK{$TIME}]
  5. start that job and redirect output to job's log file
  6. if fail, or exception raised, mark the job as [{$GPU}ERROR{$TIME}]
  7. if success, mark as [DONE{$TIME}], and move job file and log file to jobs/done/