cluster:usage_tips:startslurm

This is an old revision of the document!


Slurm is a workload manager,scheduler system. It's build from computer nodes, 1 node is the Login Node, a master node and worker node's
The login node is called cheetah.sleep.ru.is 130.208.209.30.
The cluster is a linux environment, connect with SSH

Open a terminal to log-in ssh <myUser>@cheetah.sleep.ru.is
It's possible to connect from Rivendell 130.208.209.30 (and other sleep server) with any editor SSH

Each user has his home folder, the path is [/mount/home/<username>]
The home folder should have 2 files, a readMe.lst file and template (slurm) script file (exampleJob.sh)

  1. The first step for users is to make sure their working environment fits their needs.
    For those who are using Python, we recommend, for ease of use, using a Python virtual environment
    User can then install different version of Python and all necessary modules for their work.
  2. However user can setup and install their Python modules locally without a Python virtual environment, by using Pip3 install <Module>
  3. The cluster gives users access to memory, cpu and GPU. The script file has instructing the slurm cluster what to do
  4. Users can change the slurm instructions in the script file, but it's not necessary for it to work, just append your command to it.
  5. A example of how to execute your job on the cluster; first create a file prufa.py in your home directory, append the line python3 prufa.py to exampleJob.sh
    Then submit your job to the queue with command sbatch exampleJob.sh


Your template script file (exampleJob.sh) looks like this
#!/bin/bash
#SBATCH --account=staff
#SBATCH --job-name=sleepJob
#SBATCH --gpus-per-node=1
#SBATCH --mem-per-cpu=2G
#SBATCH --output=Slurm.log

Useful commands

cmdDescr
sbatch <exampleJob.sh>Submit job
sacctMy jobs
squeueShow all jobs on the queue
sinfoInformation about the cluster
srunRun job interactively
scancel <id>kill job with <id>

Example of Python virtual environment

Python venv
Miniconda
Aaconda

Advanced tips

  • cluster/usage_tips/startslurm.1754579397.txt.gz
  • Last modified: 2025/08/07 15:09
  • by bjorn