Cell Analysis Import¶
Overview¶
Import platform‑specific cell analysis outputs (cells, boundaries, clusters/DE), convert them to web‑ready layers (PMTiles/GeoJSON), and optionally register them in a CartLoader catalog. Supported platforms include 10x Xenium and 10x Visium HD.
Requirements¶
- Cell analysis outputs from the platform (see per‑platform sections).
- For PMTiles conversion:
tippecanoeinstalled and on PATH. - Optional catalog update: an existing
catalog.yamlfrom Asset Packaging.
Example Usages¶
Two Input Modes
CartLoader accepts two input modes for each platform:
- JSON mode: pass
--in-jsonwith required keys. - Manual mode: pass
--in-dirwith specific file locations.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 | |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 | |
Actions¶
Action Specifications
No action runs by default. Activate at least one using the action parameters.
Cell Conversion Step (--cells)¶
Convert cell points/centroids, clusters, and differential expression results to PMTiles with counts and top cluster index (topK). A JSON summary of inputs/outputs and basic stats will be written.
Cell Boundary Conversion Step (--boundaries)¶
Convert cell polygons to GeoJSON with cell_id and topK attributes. A JSON summary of inputs/outputs and basic stats will be written.
Catalog Update (--update-catalog)¶
Append generated layers to an existing catalog.yaml.
UMAP Conversion Step (--umap)¶
Convert UMAP embeddings to PMTiles.
Parameters¶
Action Parameters¶
--cells(flag): Import segmented cells and generate PMTiles.--boundaries(flag): Import segmented cell boundaries and generate GeoJSON/PMTiles.--umap(flag): Import UMAP projection and generate PMTiles.--update-catalog(flag): Update an existing catalog.yaml.--all(flag): Enable all actions (--cells,--boundaries,--umap).
Input/Output Parameters¶
--outprefix(str, required): Output prefix (e.g.,/path/to/out/sample).--id(str): Identifier for the cell factor (default: basename of--outprefix).--name(str): Display name for the cell factor (default: basename of--outprefix).--tmp-dir(str): Temporary directory for intermediate files.
JSON Mode:
- --in-json (str): Path to JSON with keys CELL, BOUNDARY, CLUSTER, DE, UMAP_PROJ.
Manual Mode (relative to --in-dir):
- --in-dir (str): Base input directory.
- --csv-cells (str): Cell table (CSV/Parquet) (default: cells.csv.gz).
- --csv-boundaries (str): Cell boundaries (CSV) (default: cell_boundaries.csv.gz).
- --csv-clust (str): Clustering results (CSV) (default: analysis/clustering/gene_expression_graphclust/clusters.csv).
- --csv-diffexp (str): Differential expression (CSV) (default: analysis/diffexp/gene_expression_graphclust/differential_expression.csv).
- --csv-umap (str): UMAP projection (CSV) (default: analysis/pca/gene_expression_10_components/projection.csv).
JSON Mode:
- --in-json (str): Path to JSON with keys CELL_FEATURE_MEX, CELL_GEOJSON, CLUSTER, DE, UMAP_PROJ.
Manual Mode (relative to --in-dir):
- --in-dir (str): Base input directory.
- --mtx-cells (str): Cell-feature matrix directory (default: segmented_outputs/filtered_feature_cell_matrix).
- --geojson-cells (str): Cell segmentations (GeoJSON) (default: segmented_outputs/cell_segmentations.geojson).
- --csv-clust (str): Clustering results (CSV) (default: analysis/clustering/gene_expression_graphclust/clusters.csv).
- --csv-diffexp (str): Differential expression (CSV) (default: analysis/diffexp/gene_expression_graphclust/differential_expression.csv).
- --csv-umap (str): UMAP projection (CSV) (default: analysis/pca/gene_expression_10_components/projection.csv).
- --scale-json (str): Scale factors JSON (default: spatial/scalefactors_json.json, or None).
- --units-per-um (float): Coordinate units per µm (default: 1).
Auxiliary Parameters
PMTiles Conversion:
- --min-zoom (int, default: 10): Min zoom for cells/boundaries.
- --max-zoom (int, default: 18): Max zoom for cells/boundaries.
- --umap-min-zoom (int, default: 0): Min zoom for UMAP.
- --umap-max-zoom (int, default: 18): Max zoom for UMAP.
- --max-tile-bytes (int, default: 5000000): Max bytes per tile.
- --max-feature-counts (int, default: 500000): Max features per tile.
- --preserve-point-density-thres (int, default: 1024): Point density threshold.
Coloring and DE:
- --tsv-cmap (str): Color table TSV (default: assets/fixed_color_map_60.tsv).
- --de-max-pval (float, default: 0.01): Max p-value.
- --de-min-fc (float, default: 1.2): Min fold change.
Catalog:
- --catalog-yaml (str): Input catalog to update.
- --out-catalog-yaml (str): Output catalog path (if different from input).
Column Names (Collapsible):
Xenium Only:
- --cells-colname-cell-id (str, default: cell_id)
- --cells-colname-x (str, default: x_centroid)
- --cells-colname-y (str, default: y_centroid)
- --cells-colname-count (str, default: transcript_counts)
- --boundaries-colname-cell-id (str, default: cell_id)
- --boundaries-colname-x (str, default: vertex_x)
- --boundaries-colname-y (str, default: vertex_y)
Common/Both:
- --clust-colname-barcode (str, default: Barcode)
- --clust-colname-cluster (str, default: Cluster)
- --umap-colname-barcode (str, default: Barcode)
- --umap-colname-x (str, default: UMAP-1)
- --umap-colname-y (str, default: UMAP-2)
Environment:
- --tippecanoe (str): Path to tippecanoe binary.
- --parquet-tools (str): Path to parquet-tools (Xenium only).
- --R (str): Path to R/Rscript binary.
- --threads (int): Number of threads.
- --log (flag): Enable logging.
- --log-suffix (str): Log file suffix.
Outputs¶
- Cells PMTiles (
*.cells.pmtiles): cell centroids/points with attributescell_id,count,topK. - Boundaries GeoJSON (
*.boundaries.geojson): cell polygons withcell_id,topK. - Summary TSV/JSON (when
--summary).