I hope this message finds you well. I am reaching out to bring to your attention a potential discrepancy we have observed related to the SEA-AD MERFISH spatial transcriptomics datasets. Specifically, it is not clear to us how to identify cells between the H5AD object which summarizes the count matrix and cell-level metadata with the individual tissue-sample files which indicate detected transcripts for each molecule from the MERFISH data from individual tissue samples.
Specifically, in the uploaded files with cellpose-based detected transcripts for each tissue sample, such as (AWS S3 Explorer), we noticed that there is a column containing the cell_id for each cell that is identified via an integer (e.g., 1117161400100099968).
However, in the uploaded H5AD file (AWS S3 Explorer), the metadata slot ($obs) contains no column that refers to the cell_id. Instead, there is a column named sample_id with entries denoting cells represented usingnucleotide barcodes (e.g.,
TGTAAAGCACATTAAC-L8XR_210805_01_H09-1124629228). This is confusing to us as our expectation is that the cell_ids for the MERFISH data would be in some numeric format (as in the cellpose-detected_transcripts.csv file) and not a barcode format. Is it possible that the uploaded MERFISH metadata file corresponds to a 10x-based metadata file?
Can you please let us know if there is a different file or system we should be using to link the information on MERFISH based individual cells and transcripts from the uploaded tissue sample files with the metadata on these same cells uploaded from the .h5ad file?