/slurmd-operator

Operator charm for Slurmd.

Primary LanguagePythonApache License 2.0Apache-2.0

slurmd operator

A Juju operator for slurmd - the compute node daemon of SLURM.

Charmhub Badge CI Release Matrix

Features

The slurmd operator provides and manages the slurmd daemon. This operator provides the compute node service for machines enlisted as compute nodes in Charmed SLURM clusters.

Usage

This operator should be used with Juju 3.x or greater.

Deploy a minimal Charmed SLURM cluster

$ juju deploy slurmctld --channel edge
$ juju deploy slurmd --channel edge
$ juju integrate slurmctld:slurmd slurmd:slurmctld

Operations

This charm hardens and simplifies operations by codifying common administration operations as charm actions.

Partition Configuration

Specify partition parameters using the charm configuration, partition-config.

Use the partition-config to set custom partition parameters.
$ juju config slurmd partition-config="State=INACTIVE"

Node Configuration Parameters

You can get and set the node configuration using the node-config action.

Use the node-config action to get the node configuration for the unit.
$ juju run --quiet slurmd/0 node-config --format json | jq ".[].results.node.config"
"NodeName=juju-462521-4 NodeAddr=10.240.222.28 State=UNKNOWN RealMemory=64012 CPUs=12 ThreadsPerCore=2 CoresPerSocket=6 SocketsPerBoard=1"
Use the node-config action to set a custom weight value for the node.
$ juju run --quiet slurmd/0 node-config parameters="Weight=5000" --format json | jq ".[].results.node.config"
"NodeName=juju-462521-4 NodeAddr=10.240.222.28 State=UNKNOWN RealMemory=64012 CPUs=12 ThreadsPerCore=2 CoresPerSocket=6 SocketsPerBoard=1 Weight=5000"

Project & Community

The slurmd operator is a project of the Ubuntu HPC community. It is an open source project that is welcome to community involvement, contributions, suggestions, fixes, and constructive feedback. Interested in being involved with the development of the slurmd operator? Check out these links below:

License

The slurmd operator is free software, distributed under the Apache Software License, version 2.0. See the LICENSE file for more information.