Rule sdgeAR_reformat
:¶
Purpose¶
Reformat the spatial digital gene expression (SGE) matrix from the 10x Genomics format to a FICTURE-compatible TSV format.
Input Files¶
- Spatial Digital Gene Expression (SGE) Matrix and its Metadata File for Coordinates
Required input files include a SGE matrix and its meta file for X Y coordinates. Those files are required to be stored in the
sgeAR
subfolder in theanalysis
directory. This could be generated by Rulesdge2sdgeAR
or manually prepared by the users.
Output Files¶
The rule generates the following output in the specified directory path:
1 |
|
(1) An SGE in a FICTURE-compatible Format¶
Description: A transcript-indexed SGE in the FICTURE format is generated, which contains all information including the barcode information, features information, and count for each genomic feature.
File Naming Convention:
1 |
|
File Format:
1 2 3 4 |
|
#lane
: lane IDtile
: tile IDX
: X-coordinateY
: Y-coordinategene_id
: Gene Ensemble IDgene
: Gene symbolgn
: the count per gene per barcode for Genegt
: the count per gene per barcode for GeneFullspl
: the count per gene per barcode for Splicedunspl
: the count per gene per barcode for Unsplicedambig
: the count per gene per barcode for Ambiguous
(2) A Tab-delimited Feature File¶
Description: This include a feature file (*.feature.tsv.gz
) that counts UMIs for each gene.
File Naming Convention:
1 |
|
File Format:
1 2 3 4 |
|
gene_id
: Gene Ensemble IDgene
: Gene symbolgn
: the count per gene per barcode for Genegt
: the count per gene per barcode for GeneFullspl
: the count per gene per barcode for Splicedunspl
: the count per gene per barcode for Unsplicedambig
: the count per gene per barcode for Ambiguous
Output Guidelines¶
The output file could be used as the input for FICTURE.
Parameters¶
No additional parameter is applied in this rule.
Dependencies¶
Rule sdgeAR_reformat
executes only after sdge2sdgeAR
and its prerequisites are completed. See the Workflow Structure for dependencies.
Code Snippet¶
The code for this rule is provided in c02_sdgeAR_reformat.smk
.