Normalizing imputed spatial dataset

wykpenguin · April 21, 2026, 3:25pm

The inputed spatial transcriptomics data is only available in log2 transformed form. Some downstream analysis requires normalized data. What is the best way of doing this? Can I get back to the raw counts by 2^(log2 values)?

jeremyinseattle · April 22, 2026, 3:29pm

Hi @wykpenguin ,

According to the scientists who did this analysis from the original publication:

“We used the log normalized value (y) in the imputation, which is log2(CPM + 1), where CPM = (10^6 /sample_count_sum) * rawcount. Imputation takes a knn average of these values. So the exact reverse engineering of rawcount = ( sample_count_sum / 10^6) \*(2^y - 1) won’t be feasible. Since we are taking knn averaging, using rawcounts might not be desirable, [and] we don’t have the rawcount for Merfish data.”

If you need additional information, please reply to this thread.

Topic		Replies	Views
CPM normalization of 10X mouse single-cell RNA-seq Transcriptomics Explorer	2	1383	August 25, 2021
Raw UMI counts as input for MapMyCells	1	62	February 18, 2025
Transcriptomics (RNA-seq/microarray) data normalization - FAQ transcriptomics , tbi	17	6955	April 27, 2021
How can I download data on specific brain regions? transcriptomics	1	215	October 18, 2024
Is smart-seq matrix human multiple cortical areas normalized? Science atlas-cell-types , rna-seq , human	1	496	July 18, 2022

Normalizing imputed spatial dataset

Related topics