Process Vizgen MERSCOPE data¶
Locate the transcript file from your Vizgen MERSCOPE output, most likely named as detected_transcripts.csv.gz
.
The first few lines may look like (from the public Human lung cancer 1 FFPE dataset)
We will collapse pixels from all z-planes to 2D, essentially using only global_x
, global_y
, and transcript_id
.
The following command assume your input is in inpath
and you you want to store output to path
. In some data the negative control proves are names "Blank-*", --dummy_genes Blank
will regard any transcript_id
containing the substring "Blank" as control probes.
The script foramt_Vizgen.py
can be found in misc/
in the FICTURE repository.
iden=mouselung # Set how you want to call your dataset
input=${inpath}/detected_transcripts.csv.gz
output=${path}/filtered.matrix.${iden}.tsv
feature=${path}/feature.clean.${iden}.tsv.gz
coor=${path}/coordinate_minmax.tsv
python foramt_Vizgen.py --input ${input} --output ${output} --feature ${feature} --coor_minmax ${coor} --precision 2 --dummy_genes Blank
sort -S 4G -k1,1g ${output} | gzip -c > ${output}.gz
rm ${output}