Issue matching cells with their metadata from SEA-AD spatial transcriptomics datasets

I hope this message finds you well. I am reaching out to bring to your attention a potential discrepancy we have observed related to the SEA-AD MERFISH spatial transcriptomics datasets. Specifically, it is not clear to us how to identify cells between the H5AD object which summarizes the count matrix and cell-level metadata with the individual tissue-sample files which indicate detected transcripts for each molecule from the MERFISH data from individual tissue samples.

Specifically, in the uploaded files with cellpose-based detected transcripts for each tissue sample, such as (AWS S3 Explorer), we noticed that there is a column containing the cell_id for each cell that is identified via an integer (e.g., 1117161400100099968).

However, in the uploaded H5AD file (AWS S3 Explorer), the metadata slot ($obs) contains no column that refers to the cell_id. Instead, there is a column named sample_id with entries denoting cells represented usingnucleotide barcodes (e.g.,
TGTAAAGCACATTAAC-L8XR_210805_01_H09-1124629228). This is confusing to us as our expectation is that the cell_ids for the MERFISH data would be in some numeric format (as in the cellpose-detected_transcripts.csv file) and not a barcode format. Is it possible that the uploaded MERFISH metadata file corresponds to a 10x-based metadata file?

Can you please let us know if there is a different file or system we should be using to link the information on MERFISH based individual cells and transcripts from the uploaded tissue sample files with the metadata on these same cells uploaded from the .h5ad file?

Thanks!

Hi @tson thanks for your interest in the data.

You’re right- that h5ad file is the single nucleus data- the MERSCOPE spatial data is currently located here. These locations are confusing and we’ll be changing them when we update these files soon.

However, our current pipeline for this data doesn’t track the cell ids from the segmentation results through to the aggregated anndata object. We may be able to re-link these IDs before we update the h5ad file- I’ll post to this thread when that update happens.