Singularity Example
Running NovaScope with pre-built docker image through singularity requires a system that has Singularity installed. Unlike Docker, running singularity does not necessarily require a root permission and can be often found in HPC environment. We still recommend a system that has >32GB memory and >64GB disk space.
Running NovaScope using singularity involves three main steps: (1) setting up singularity in your system, (2) downloading example data and reference files, and (3) running the pipeline. We will assume that you have /path/to/working/dir
as your working directory. We will use the minimal test run data as an example, but you may replace it with other datasets.
Setting up singularity in your system¶
If you are new to Singularity, please refer to the Singularity documentation for installation and basic usage. To ensure that singularity is installed in your system, you can run the following command:
1 2 |
|
In some systems, you may need to load the singularity module before running the above command. For example, in the HPC environment, you may need to run the following command:
1 2 |
|
If the above commands do not run, you need to install singularity in your system. Currently, our images are built for Intel/AMD x86_64 architecture.
Download example data and reference files¶
You may download the minimal test run data and STAR mouse reference index by running the following commands:
1 2 3 4 5 6 7 8 9 10 11 |
|
Pulling the docker image and converting it to singularity image¶
You will need to pull the docker image and convert it to a singularity image. You can do this by running the following command:
1 |
|
If successful, this will create a file named novascope_latest.sif
in your working directory.
Running the NovaScope pipeline¶
You may perform a dry-run to test whether the NovaScope pipeline with the minimal test example data is working properly.
1 2 3 4 5 6 7 |
|
If the dry-run is successful, you may run the pipeline with the following command:
1 2 3 4 5 6 7 8 |
|
The pipeline will generate the output files in the /path/to/working/dir/output
directory, typically in 10 minutes.
To run NovaScope with different datasets, you will need to modify the Configuration File yourself and specify it as --configfile
argument in the command. See detailed instructions in the Job Configuration section.