Sample submission file for the ingestion of spatial data

The spatial transcriptomics ingestion workflow requires a sample submission file that specifies the location of the input files. The sample submission file is a tab-separated file with one row per sample. Panpipes currently supports the ingestion of Visium and Vizgen data.

The 6 columns of the sample submission file are:

sample id: Unique sample ID.

spatial_path: The root directory containing the data files. Please note, that the folder structure of the root directory needs to be structured as expected by the squidpy.read.visium (for Visium data) or squidpy.read.vizgen (for Vizgen data) functions.

spatial_filetype: Either “vizgen” or “visium”.

spatial_counts: The count matrix file. Usually filtered_feature_bc_matrix.h5 or raw_feature_bc_matrix.h5 for a Visium dataset. For Vizgen inputs, this file typically ends with _cell_by_gene.csv.

spatial_metadata: The metadata csv-file for Vizgen data. Leave empty for Visium data.

spatial_transformation: The transformation csv-file for Vizgen data. This column is optional for Vizgen data. Leave empty for Visium data.

Note, that the columns, sample_id, spatial_path, spatial_filetype, and spatial_counts are required for both, Visium and Vizgen data. The spatial_metadata(required) and spatial_transformation(optional) columns are Vizgen-specific and should be left empty for Visium data.

Example submission file

sample_id

spatial_path

spatial_filetype

spatial_counts

spatial_metadata

spatial_transformation

V1_Human_Heart

./data_visium/V1_Human_Heart

visium

V1_Human_Heart_filtered_feature_bc_matrix.h5

V1_Human_Lymph_Node

./data_visium/V1_Human_Lymph_Node

visium

V1_Human_Lymph_Node_filtered_feature_bc_matrix.h5

Mouse_Brain

./data_vizgen

vizgen

cell_by_gene_S1R1.csv

cell_metadata_S1R1.csv

images_micron_to_mosaic_pixel_transform.csv