slurm
There are 662 repositories under slurm topic.
stas00/ml-engineering
Machine Learning Engineering Open Book
nextflow-io/nextflow
A DSL for data-driven computational pipelines
SchedMD/slurm
Slurm: A Highly Scalable Workload Manager
facebookincubator/submitit
Python 3.8+ toolbox for submitting jobs to Slurm
DataBiosphere/toil
A scalable, efficient, cross-platform (Linux/macOS) and easy-to-use workflow engine in pure Python.
PySlurm/pyslurm
Python Interface to Slurm
rackslab/Slurm-web
Open source web interface for Slurm HPC clusters
elasticluster/elasticluster
Create clusters of VMs on the cloud and configure them with Ansible.
pytorch/torchx
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.
giovtorres/slurm-docker-cluster
A Slurm cluster using docker-compose
LambdaLabsML/distributed-training-guide
Best practices & guides on how to write distributed pytorch training code
pipefunc/pipefunc
Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪
justanhduc/task-spooler
A scheduler for GPU/CPU tasks
Azure/batch-shipyard
Simplify HPC and Batch workloads on Azure
zhenrong-wang/hpc-now
A Cross-Platform, Multi-Cloud High-Performance Computing Platform
vpenso/prometheus-slurm-exporter
Prometheus exporter for performance metrics from Slurm.
dell/omnia
An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.
mllg/batchtools
Tools for computation on batch systems
TUM-DAML/seml
SEML: Slurm Experiment Management Library
mschubert/clustermq
R package to send function calls as jobs on LSF, SGE, Slurm, PBS/Torque, or each via SSH
jdblischak/smk-simple-slurm
A simple Snakemake profile for Slurm without --cluster-config
nebius/soperator
Run Slurm in Kubernetes
gdikov/hypertunity
A toolset for black-box hyperparameter optimisation.
kabouzeid/turm
TUI for the Slurm Workload Manager
SciDAS/slurm-in-docker
Slurm in Docker - Exploring Slurm using CentOS 7 based Docker images
ohsu-comp-bio/funnel
Funnel is a toolkit for distributed task execution via a simple, standard API.
NREL/HPC
A collection of various resources, examples, and executables for the general NREL HPC user community's benefit. Use the following website for accessing documentation.
sylabs/wlm-operator
Singularity implementation of k8s operator for interacting with SLURM.
neilmunday/slurm-mail
Slurm-Mail is a drop in replacement for Slurm's e-mails to give users much more information about their jobs compared to the standard Slurm e-mails.
giovtorres/docker-centos7-slurm
Slurm Docker Container on CentOS 7
CLAIRE-Labo/python-ml-research-template
A template for starting reproducible Python machine-learning projects with hardware acceleration. Find an example at https://github.com/CLAIRE-Labo/no-representation-no-trust
futureverse/future.batchtools
:rocket: R package future.batchtools: A Future API for Parallel and Distributed Processing using batchtools
mil-ad/stui
A Slurm dashboard for the terminal.
JacopoPan/a-minimalist-guide
Walkthroughs for DSL, AirSim, the Vector Institute, and more
NERSC/slurm-magic
IPython magic for SLURM.
aws-samples/aws-hpc-recipes
Contains example recipes that demonstrate how to build HPC systems using AWS services and solutions.