Labels

Sunday, November 15, 2020

Linux resource management tool Slurm

Slurm is a group utilities used for managing workloads on computer clusters.

Some clusters use Slurm as the batch queuing system and the scheduling mechanism.

Jobs are submitted to Slurm from a login node and Slurm handles scheduling these jobs on nodes.

sinfo -l command displays information about nodes and partitions.

sbatch job.sh submit a job where job.sh is the script you want to run.

scancel jobID cancels a job

squeue -u userID shows state of jobs


An example for job submit script, e.g., job.slurm 


Execute script

$ sbatch job.slurm


two examples interactive shell
$srun -n 1 --pty bash -i

$srun --mem=4096 --pty  /bin/bash --time=01:00:00

No comments:

Post a Comment