Skip to content

(Optional) Snakemake with SLURM

Tip

We recommended to integrate SLURM scheduler with NovaScope to automate the process of job submission.

Info

Please be aware that Snakemake introduced significant updates for cluster configuration starting from version 8. So, please verify your Snakemake version using snakemake --version.

In NovaScope, we utilized a cluster configuration profile to define the details of the cluster and resources given its consistency and time-saving benefits (see details below). Those files were crafted with inspiration from the smk-simple-slurm repository.

A Cluster Configuration File for Snakemake v7.29.0

Create a config.yaml with the following settings. Please substitute the placeholders, which are marked with <>, to suit your specific case. An example file is provided at slurm/v7.29.0/config.yaml.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
## Cluster Configuration
## The following setting also aids in organizing log files by creating rule-specific subdirectories within the job's log directory, each holding its own output and error files.
cluster:
  mkdir -p logs/{rule}/ &&
  sbatch
    --job-name={rule}_{wildcards}
    --output=logs/{rule}/{rule}___{wildcards}___%j.out
    --error=logs/{rule}/{rule}___{wildcards}___%j.err
    --account={resources.account}
    --partition={resources.partition}
    --mem={resources.mem}
    --time={resources.time}
    --cpus-per-task={threads}
    --parsable
    --nodes={resources.nodes}

## Default Resources for Jobs
default-resources:
  - partition=<your_default_partition>    # replace <your_default_partition> with your actual partition name
  - mem=<default_memory_allocation>       # replace <default_memory_allocation> with memory, e.g., "4G"
  - time=<default_time_limit>             # replace <default_time_limit> with time, e.g., "01:00:00"
  - nodes=<default_number_of_nodes>       # replace <default_number_of_nodes> with nodes, e.g., "1"
  - account=<default_account_information> # replace <default_account_information> with your account info

## General Snakemake Settings
jobs: <max_number_of_jobs>                  # replace <max_number_of_jobs> with your desired maximum number of concurrent jobs, e.g., 10
latency-wait: <latency_seconds>             # replace <latency_seconds> with the number of seconds to wait if job output is not present, e.g., 120
local-cores: <local_core_count>             # replace <local_core_count> with the max number of cores to use locally, e.g., "20"
restart-times: <restart_attempts>           # replace <restart_attempts> with the number of times to retry failing jobs, e.g., "0" for no retries
max-jobs-per-second: <job_submission_rate>  # replace <job_submission_rate> with the limit on how many jobs can be submitted per second, e.g., "20"
keep-going: <continue_after_failure>        # replace <continue_after_failure> with True or False to indicate whether to continue executing other jobs after a failure
rerun-incomplete: <rerun_incomplete_jobs>   # replace <rerun_incomplete_jobs> with True or False to decide if incomplete jobs should be rerun
printshellcmds: <print_commands>            # replace <print_commands> with True or False to specify if shell commands should be printed before execution

## Scheduler Settings
#scheduler: greedy      

## Conda Environment Settings
use-conda: <True_or_False>               # enable use of Conda environments
conda-frontend: conda                    # specify Conda as the package manager frontend

A Cluster Configuration file for Snakemake v8.6.0

Please first install the Snakemake executor plugin "cluster-generic":

1
pip install snakemake-executor-plugin-cluster-generic

Then, create the cluster configuration file with below. Please substitute the placeholders, which were marked with <>, to suit your specific case. Please see our example file at slurm/v8.6.0/config.yaml.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
## Cluster Configuration
## The following setting also aids in organizing log files by creating rule-specific subdirectories within the job's log directory, each holding its own output and error files.
executor: "cluster-generic"
cluster-generic-submit-cmd: "mkdir -p logs/{rule}/ &&
  sbatch
    --job-name={rule}_{wildcards} \
    --output=logs/{rule}/{rule}___{wildcards}___%j.out \
    --error=logs/{rule}/{rule}___{wildcards}___%j.err \
    --partition={resources.partition} \
    --mem={resources.mem} \
    --time={resources.time} \
    --cpus-per-task={threads} \
    --parsable \
    --nodes={resources.nodes} "

## Default Resources for Jobs
default-resources:
  - partition=<your_default_partition>    # replace <your_default_partition> with your actual partition name
  - mem=<default_memory_allocation>       # replace <default_memory_allocation> with memory, e.g., "4G"
  - time=<default_time_limit>             # replace <default_time_limit> with time, e.g., "01:00:00"
  - nodes=<default_number_of_nodes>       # replace <default_number_of_nodes> with nodes, e.g., "1"
  - account=<default_account_information> # replace <default_account_information> with your account info

## General Snakemake Settings
jobs: <max_number_of_jobs>                  # replace <max_number_of_jobs> with your desired maximum number of concurrent jobs, e.g., 10
latency-wait: <latency_seconds>             # replace <latency_seconds> with the number of seconds to wait if job output is not present, e.g., 120
local-cores: <local_core_count>             # replace <local_core_count> with the max number of cores to use locally, e.g., "20"
restart-times: <restart_attempts>           # replace <restart_attempts> with the number of times to retry failing jobs, e.g., "0" for no retries
max-jobs-per-second: <job_submission_rate>  # replace <job_submission_rate> with the limit on how many jobs can be submitted per second, e.g., "20"
keep-going: <continue_after_failure>        # replace <continue_after_failure> with True or False to indicate whether to continue executing other jobs after a failure
rerun-incomplete: <rerun_incomplete_jobs>   # replace <rerun_incomplete_jobs> with True or False to decide if incomplete jobs should be rerun
printshellcmds: <print_commands>            # replace <print_commands> with True or False to specify if shell commands should be printed before execution

## Conda Environment Settings
software-deployment-method: conda

Additional Functions

If you encounter situations where your jobs fail without any warnings, it is feasible to using a cluster status script to keep track of your jobs. See details here.