Actions

SLURM-Partition

From ALICE Documentation

Revision as of 15:08, 9 September 2020 by Schulzrf (talk | contribs) (Partition)

Partition

Slurm organises the resources in a cluster in so-called partitions and jobs are always submitted to either a default partition or a user-specified partition.

The command sinfo lists the available partitions, their state and resources.

Currently, partitions on ALICE differ primarily in terms of the available nodes and time limit for a job:

Partition Timelimit Nodes Nodelist Description
testing 1:00:00 2 nodelogin[01-02] For some basic and short testing of batch scripts
cpu-short 3:00:00 20 nodes[001-020] For jobs that require CPU nodes and not more than 3h of running time. This is the default partition
cpu-medium 1-00:00:00 20 nodes[001-020] For jobs that require CPU nodes and not more than 1d of running time
cpu-long 7-00:00:00 20 nodes[001-020] For jobs that require CPU nodes and not more than 7d of running time
gpu-short 3:00:00 10 nodes[851-860] For jobs that require GPU nodes and not more than 3h of running time
gpu-medium 1-00:00:00 10 nodes[851-860] For jobs that require GPU nodes and not more than 1d of running time
gpu-long 7-00:00:00 10 nodes[851-860] For jobs that require GPU nodes and not more than 7d of running time
mem inifinite 1 nodes801 For jobs that require the high memory node. There is no time limit for this partition
notebook-cpu infinite 5 nodes[001-005] For interactive jobs that require CPU nodes. There is no time limit for this partition
notebook-gpu infinite 2 nodes[851-852] For interactive jobs that require GPU nodes. There is no time limit for this partition
Partition Timelimit Nodes Nodelist Description
testing 1:00:00 2 nodelogin[01-02] For some basic and short testing of batch scripts
cpu-short 3:00:00 20 nodes[001-020] For jobs that require CPU nodes and not more than 3h of running time. This is the default partition
cpu-medium 1-00:00:00 20 nodes[001-020] For jobs that require CPU nodes and not more than 1d of running time
cpu-long 7-00:00:00 20 nodes[001-020] For jobs that require CPU nodes and not more than 7d of running time
gpu-short 3:00:00 10 nodes[851-860] For jobs that require GPU nodes and not more than 3h of running time
gpu-medium 1-00:00:00 10 nodes[851-860] For jobs that require GPU nodes and not more than 1d of running time
gpu-long 7-00:00:00 10 nodes[851-860] For jobs that require GPU nodes and not more than 7d of running time
mem inifinite 1 nodes801 For jobs that require the high memory node. There is no time limit for this partition
notebook-cpu infinite 5 nodes[001-005] For interactive jobs that require CPU nodes. There is no time limit for this partition
notebook-gpu infinite 2 nodes[851-852] For interactive jobs that require GPU nodes. There is no time limit for this partition

In your batch script, you can use the following command to set the partition you need:

--partition=<partition-name>