Slurm Howto

Slurm Workload Manager is a batch scheduling software used for requesting resources and running jobs on the CoE HPC cluster. Once your HPC account has been enabled, you will be assigned to a Slurm account corresponding to your department or class. Assignment to a research group account is required for accessing private resources, and assignment to a class account is required for accessing resources reserved for class use.

The Slurm software module should be loaded automatically when you log into a submit node, which you can confirm by typing:

module list

If Slurm is not listed, you may need to login to TEACH and click "Reset Unix Config Files". In the short term you may load it by typing:

module load slurm

Once the Slurm module is loaded, you can begin using Slurm to request resources and submit jobs. See the sections below for examples using Slurm.

Launch interactive sessions using srun

If you want to request resources for an interactive session or single command, you can use the Slurm command "srun":

srun {srun options} {command or shell}

See examples of srun below:

To start an interactive (pseudo-terminal bash shell) session on the cluster:

srun --pty bash

The tcsh and other shells can be used instead of bash for srun (and sbatch) if that is preferred.

To request an interactive tcsh session with X11 forwarding (e.g. for running a GUI) from the default (share) queue for more than the default 12 hours, e.g. 3 days:

srun --time=3-00:00:00 --x11 --pty tcsh

To request an interactive bash session with 8 OpenMP threads/cores and 50GB RAM from the preempt queue:

srun -p preempt -c 8 --mem=50G --pty bash

To request an interactive bash session to run an application using 12 OpenMP threads and 4GB per thread/cpu from the share queue:

srun --cpus-per-task=12 --mem-per-cpu=4G --pty bash

To request an interactive zsh session with a GPU from the dgxh queue:

srun -p dgxh --gres=gpu:1 --pty zsh

To request an interactive session with a GPU from either the dgx2, gpu or eecs queues using the eecs account:

srun -A eecs -p dgx2,gpu,eecs --gres=gpu:1 --pty bash

To request an interactive session with two GPUs on a specific node in the dgx2 queue using the eecs account:

srun -p dgx2 --nodelist=dgx2-1 --gres=gpu:2 --pty bash

To request an interactive bash session from the class queue using the cs499 account:

srun -A cs499 -p class --pty bash

To request an interactive session to run an MPI application with 8 processes over an arbitrary number of nodes from the preempt queue:

srun --partition=preempt -n 8 --pty bash

To request an interactive session to run an MPI application on 2 nodes with 8 processes per node from the share queue:

srun -N 2 --ntasks-per-node=8 --pty bash

To request an interactive tcsh session on a compute node running the "el8" operating system, from the preempt queue:

srun -p preempt --constraint=el8 --pty tcsh

To request an interactive session with one GPU on a compute node with an A40 GPU, from the preempt queue:

srun -p preempt --gres=gpu:1 --constraint=a40 --pty bash

For more information or options on srun, check out the manual page by typing "man srun".

Submit batch jobs using sbatch

Submitting batch jobs are useful when you need to run several jobs, or long jobs. Batch job submissions will not be interrupted by lost network or VPN connections. The Slurm command "sbatch" is used to submit batch jobs.

To submit a batch job to the cluster:

sbatch myBatchFile.sh

where myBatchFile.sh is a script containing the commands you wish to run as well as a list of SBATCH directives specifying the resources or parameters that you need for your job. See the sample myBatchFile.sh batch script below.

#!/bin/bash
#SBATCH -J helloWorld						  # name of job
#SBATCH -A mySponsoredAccount	  # name of my sponsored account, e.g. class or research group, NOT ONID!
#SBATCH -p share								  # name of partition or queue
#SBATCH -o helloWorld.out				  # name of output file for this submission script
#SBATCH -e helloWorld.err				  # name of error file for this submission script

# load any software environment module required for app (e.g. matlab, gcc, cuda)
module load software/version

# run my job (e.g. matlab, python)
mySoftwareExecutable

There are additional sample batch scripts located in /apps/samples, so feel free to copy them to your directory and modify them according to your needs.

Here are some other useful SBATCH directives that can be included in a batch script to specify additional resources or different parameters from the defaults:

#SBATCH -p dgx2                                   # partition (default share)
#SBATCH --partition=dgxh                   # partition (default share)
#SBATCH -A mySpecialAccount          # account name to access special or private partition
#SBATCH --account=mySpecialAcct  # account name to access special or private partition
#SBATCH -t 24:30:00                            # time limit on job: 24 hours, 30 minutes (default 12 hours)
#SBATCH --time=2-12:30:00                # time limit on job: 2 days, 12 hours, 30 minutes (default 12 hours)
#SBATCH --gres=gpu:1                         # number of GPUs to request per node (default 0)
#SBATCH --gpus-per-node=1              # number of GPUs per node (default 0)
#SBATCH -G 1                                        # number of GPUs per job (default 0)
#SBATCH --gpus-per-job=1                  # number of GPUs per job (default 0)
#SBATCH --mem=10G                          # request 10 gigabytes memory per node (default depends on node)
#SBATCH --mem-per-cpu=10G           # request 10 gigabytes per core/cpu
#SBATCH -c 4                                         # number of cores/threads per task (default 1)
#SBATCH --cpus-per-task=2                # number of cores/threads per task (default 1)
#SBATCH -n 3                                         # number of MPI tasks per job (default 1)
#SBATCH --ntasks=3                             # number of MPI tasks per job (default 1)
#SBATCH --ntasks-per-node=3           # number of MPI tasks per node (default 1), use for multiple nodes (-N>1)
#SBATCH -N 2                                        # number of nodes per job (default 1)
#SBATCH --nodes=2                              # number of nodes per job (default 1)
#SBATCH --nodelist=node1,node2...  # required node, or comma separated list of required nodes
#SBATCH --exclude=node1,node2...  # exclude node, or comma separated list of excluded nodes
#SBATCH --constraint=el9                   # request node running el9 operating system
#SBATCH --constraint=avx512            # request node with AVX512 instruction set
#SBATCH --constraint=a40                  # request node with A40 GPU
#SBATCH --mail-type=FAIL                  # send email notification if the job fails
#SBATCH --mail-type=BEGIN,END     # send email notification when the job starts and completes
#SBATCH --mail-type=ALL                   # send email notification when job starts, ends, fails, or requeues
#SBATCH --mail-type=TIMELIMIT_90 # send email notification when job reaches 90% of time limit

For more information or options on sbatch, check out the manual page by typing "man sbatch".

Here are the currently available node features that can be requested using the Slurm "--constraint" option:

#SBATCH --constraint=el8                   # request node running RHEL8 based operating system
#SBATCH --constraint=el9                  # request node running RHEL9 based operating system

#SBATCH --constraint=avx                  # request node with AVX instruction set
#SBATCH --constraint=avx2                # request node with AVX2 instruction set
#SBATCH --constraint=avx512            # request node with AVX512 instruction set

#SBATCH --constraint=haswell          # request node with Intel Haswell CPU
#SBATCH --constraint=broadwell      # request node with Intel Broadwell CPU
#SBATCH --constraint=skylake           # request node with Intel Skylake CPU
#SBATCH --constraint=cascadelake    # request node with Intel Cascadelake CPU
#SBATCH --constraint=epyc              # request node with AMD Epic CPU

#SBATCH --constraint=m60                 # request node with M60 GPU
#SBATCH --constraint=t4                     # request node with T4 GPU
#SBATCH --constraint=rtx6000           # request node with RTX6000 GPU
#SBATCH --constraint=rtx8000           # request node with RTX8000 GPU
#SBATCH --constraint=v100                # request node with V100 GPU
#SBATCH --constraint=a40                  # request node with A40 GPU
#SBATCH --constraint=h100               # request node with h100 GPU
#SBATCH --constraint=h200               # request node with h200 GPU
#SBATCH --constraint=l40s               # request node with L40S GPU

Note that when requesting a node with a specific GPU using the constraint option, you must still use the "--gres" option to request the GPU itself.

View status of jobs

To view the list and status of all jobs in all queues:

squeue
-or-
sq

To view status of jobs in the queue but with an alternate, extended listing format:

sqe

To view the list and status of all jobs in one partition or queue:

squeue -p {partition}
-or-
sq -p {partition}
-or-
sqe -p {partition}

To view the list and status of your jobs:

sq -u {ONID}
-or-
squ
-or-
sqe -u {ONID}

To see more detailed information on a specific pending or running job:

showjob {jobid}

For more information or options on squeue, type "man squeue".

Cancel jobs

To cancel a job on the cluster:

scancel {jobid}

To cancel all of your jobs on the cluster:

scancel -u {ONID}

To cancel all of your jobs on one partition:

scancel -p {partition}

For more information or options on scancel, type "man scancel".

View status of partitions and nodes

To view a list of queues and their status:

sinfo

For more information or options on sinfo, type "man sinfo".

To view status and available resources (CPU, GPU, RAM) of all nodes in a partition:

nodestat {partition}

Summary of partitions

Summary of partition limits

For more information on using Slurm, visit the website below:

https://slurm.schedmd.com/

Partition	Description/Owner	Accounts	Features (e.g. OS, GPU)
share	Shared resources, open to all users	ALL	el8, el9, m60
class	Resources reserved for class use	class accounts	el8, el9, varies
dgxh	DGX-H systems	ALL	el9, h100, h200, vram80g, vram140g
dgxh-ceoas	DGX-H systems for CEOAS	ceoas	el9, h100, vram80g, vram140g
dgx2	DGX-2 systems	ALL	el9, v100
gpu	Shared GPU servers for EECS	eecs	el9, rtx8000
gpu-dmv	Resources for research group	dmv	el9, rtx8000
ampere	Resources for research group	ALL	el9, a40
athena	Resources for research group	hlab	el9, a40
bee	Shared resources for BEE	fc-lab, sg-ecohydro	el9, v100
bee1	Resources for research group	fc-lab	el9, v100
cbee	Resources for research group	la-grp	el8
cp-grp	Resources for research group	cp-grp	el8
ecohydro	Resources for research group	sg-ecohydro	el9
eecs	Shared resources for EECS	eecs	el9, rtx2080
eecs2	Resources for research group	virl-grp	el9, rtx6000
eecs3	Resources for research group	rah-grp	el9, m60
hw-grp	Resources for research group	hw-grp	el9, h100
maml	Resources for research group	maml	el9
mime1	Resources for research group	kt-lab	el8
mime2	Resources for research group	jt-grp	el8
mime3	Resources for research group	mime3_grp	el8
mime4	Resources for research group	nrg	el8
mime5	Resources for research group	simon-grp	el8
mime7	Resources for research group	ba-grp	el8
nacse	Resources for NACSE	nacse	el9, a40
nerhp	Shared resources for NSE	nse	el8
nerhp2	Resources for research group	ig-lab	el8
nse3	Shared resources for research groups	cp-grp, tp-grp	el8
sail	Resources for research group	sail	el9, a40
soundbendor	Resources for research group	soundbendor	el9, rtx6000, t4
sy-grp	Resources for research group	sy-grp	el9, h100
tiamat	Shared resources for research groups	dmf, virl-grp, af-lab, iras	el9, l40s
tp-grp	Resources for research group	tp-grp	el8
preempt	Low priority queue with access to all resources, but subject to preemption by higher priority jobs	ALL	ALL

Partition	Time Limit	CPU cores/ CPU time	GPUs/ GPU time	RAM/ GB time
share	7 days	512/ 512 CPU-days= 512 CPUs/day, or 256 CPUs/2 days, or 128 CPUs/4 days, or 73 CPUs/week	2 2 GPU-days= 1 GPU/2 days, or 2 GPUs/1 day	512 GB/ 1536 GB-days= 219 GB/week, or 384 GB/4 days, or 768 GB/2 days, or 1536 GB/day
class	varies	varies	varies	varies
dgxh	2 days	224/ 64 CPU-days= 32 CPUs/2 days, or 64 CPUs/1 day, or 128 CPUs/12 hrs, or 224 CPUs/6 hrs	8/ 2 GPU-days= 1 GPU/2 days, or 2 GPUs/1 day, or 4 GPUs/12 hrs, or 8 GPUs/6 hrs	2000 GB/ 500 GB-days= 250 GB/2 days, or 500 GB/1 day, or 1000 GB/12 hrs, or 2000 GB/6 hrs
dgx2	7 days	96/ 48 CPU-days= 6 CPUs/week, or 12 CPUs/4 days, or 24 CPUs/2 days, or 48 CPUs/1 day, or 96 CPUs/12 hrs	16/ 8 GPU-days= 1 GPU/week, or 2 GPUs/4 days, or 4 GPUs/2 days, or 8 GPUs/1 day, or 16 GPUs/12 hrs	1500 GB/ 750 GB-days= 93 GB/weeks, or 187 GB/4 days, or 375 GB/2 days, or 750 GB/1 day, or 1500 GB/12 hrs
gpu	7 days	40/ 48 CPU-days= 6 CPUs/week, or 12 CPUs/4 days, or 24 CPUs/2 days, or 40 CPUs/1 day	8/ 8 GPU-days= 1 GPU/week, or 2 GPUs/4 days, or 4 GPUs/2 days, or 8 GPUs/1 day	750 GB/ 750 GB-days= 93 GB/weeks, or 187 GB/4 days, or 375 GB/2 days, or 750 GB/1 day
ampere	2 days	64/ 64 CPU-days= 32 CPUs/2 days, or 64 CPUs/1 day	2 2 GPU-days= 1 GPU/2 days, or 2 GPUs/1 day	250 GB/ 250 GB-days= 125 GB/2 days, or 250 GB/day
eecs	7 days	40/ 48 CPU-days= 6 CPUs/week, or 12 CPUs/4 days, or 24 CPUs/2 days, or 40 CPUs/1 day	8/ 8 GPU-days= 1 GPU/week, or 2 GPUs/4 days, or 4 GPUs/2 days, or 8 GPUs/1 day	250 GB/ 250 GB-days= 31 GB/week, or 62 GB/4 days, or 125 GB/2 days, or 250 GB/1 day
preempt	7 days	none	none	none

Information Technology and Computing Support

Launch interactive sessions using srun

Submit batch jobs using sbatch

View status of jobs

For more information or options on squeue, type "man squeue".

Cancel jobs

View status of partitions and nodes

Summary of partitions

Summary of partition limits

HPC Training and Resources

Nvida Resources

Mark III Training Opportunities

Contact Info