Issues
- 0
Failure of submitit 1.5.1 with python 3.12 because of missing pkg_resources
#1765 opened by GianlucaFicarelli - 2
UnicodeDecodeError fails the job
#1712 opened by phtu-cs - 0
Can I use torchrun with submitit?
#1764 opened by vasudev-sharma - 2
Turn off Signal Handling
#1760 opened by lukasbm - 2
Using `RsyncSnapshot` with a editable package install
#1763 opened by jc-audet - 2
be tolerating about sacct error?
#1720 opened by min-xu-ai - 0
- 0
Unexpected behavior of memory specification between `AutoExecutor` and `SlurmExecutor`
#1761 opened by mshvartsman - 0
Too many sacct requests for batched tasks
#1759 opened by Fadelis98 - 0
Failed to launch: Invalid wckey specification
#1758 opened by rskwesterman - 0
- 0
Improving performance with NVidia GPU affinity?
#1756 opened by giorgos117 - 2
Consider supporting slurm rest api
#1719 opened by zeronewb - 2
No user code logging output is shown in logs
#1721 opened by fleimgruber - 1
- 2
SLURM Job keeps running after Successful Job Completon (Hydra Submitit Plugin)
#1731 opened by subho406 - 2
Enabling sbatch file re-use.
#1739 opened by alexnwang - 1
- 1
- 0
timeout_min=0 results in pending jobs when a Slurm partition timelimit is set
#1742 opened by ddangu525 - 2
Support Slurm Heterogeneous Job
#1741 opened by sunshine-syz - 4
Submit Over SSH?
#1711 opened by JRJacoby - 1
Conda version out of date
#1737 opened by Ubadub - 1
- 1
array_parallelism on local machine
#1736 opened by sparisi - 2
Submitit with SLURM sub-scheduling
#1734 opened by giorgos117 - 3
duplicate tasks when using `SlurmExecutor.map_array`
#1727 opened by eringrant - 0
- 0
SLURM Jobs keep running after successful job completion.
#1730 opened by subho406 - 4
Add custom options to sbatch command in SLURM
#1728 opened by nilskober - 6
Submitit with sbatch
#1726 opened by pfrwilson - 1
Unwanted behavior after a slurm job time limit
#1708 opened by ofir1080 - 1
Printing in Signal Handlers May Be Unsafe
#1714 opened by Queuecumber - 5
How to specify GPUs when executing locally?
#1696 opened by j0ma - 3
Submitit puts all tasks on a single GPU
#1704 opened by Bai-YT - 1
Should we submit job on login node?
#1722 opened by surajmenon72 - 1
- 2
array_parallelism for LocalExecutor
#1718 opened by se-ok - 1
submitit.core.utils.FailedJobError: sbatch: error: Parameter --gres=gpu:1 no longer acceptable, please switch to --gpus=1
#1710 opened by RoyAmoyal - 1
Can submitit manage chain dependencies?
#1723 opened by eserie - 1
Compute Canada
#1724 opened by kaijieshi7 - 2
NodeList Declaration
#1692 opened by Bontempogianpaolo1 - 3
Remove `#SBATCH --nodes=1`
#1725 opened by sgbaird - 1
How to load the original code point when preempted and rescheduled if the code is changed before rescheduling?
#1717 opened by dahyun-kang - 4
Switching from USR1 Breaks Pytorch Lightning
#1709 opened by Queuecumber - 1
Recover jobs after kernel dies
#1707 opened by SamuelGabriel - 0
submit job array to multiple partitions
#1698 opened by MinkyuHa - 2
Latest versions' tags not on Github
#1690 opened by tmct - 1
Task does not wait for GPU memory resources
#1689 opened by chirico85 - 1
filedescriptor out of range in select()
#1687 opened by timlacroix