I was wondering if there is a way to get the full post-clustering gene expression vs. cell type that is displayed here for the mouse MOp? Brain Knowledge Platform
Or do I need to take the raw data provided here (Main page) and cluster it myself?
If so, any tips for making sure my clusters match the BICCN MOp clusters?
Basically I am hoping to get the full list of marker genes for each cell subtype in the Allen MOp taxonomy
Edit:
Can the new MapMyCells service help with this? Or Azimuth?
Thanks for your question! You will need to get the cell x gene data from NeMO, as you mention, but the marker genes and annotations you’re looking for are available in this repo, although spread across a few different tables:
All NS-Forest Marker genes for each cell type in the taxonomy as displayed on the Cell Type Knowledge Explorer, mapped to taxonomy Accession IDs
Azimuth does host the MOp taxonomy so could be useful in your work, however MapMyCells currently does not. MapMyCells does host the whole mouse brain taxonomy, which you may also find useful if you’re looking for broader cell type annotations.
Thanks so much Ray! Would you say one approach may be as follows then:
Download the cell x gene data from NeMO, then for each cell in the cell x gene data (tagged via it’s cell barcode), I use the ‘cell to cell set assignments’ to find it’s Accession ID in the taxonomy, then use the ‘Taxonomy nomenclature table’ to find it’s cell type alias?
And then sorry not sure if you know this, but was all of this data used in determining the final published taxonomy? Just preliminarily searching through it only a subset of cells were included, possibly filtered through some QC mechanism
Okay I think I have an understanding of how to do it after doing some thinking and reading through the datasets more; will let you know if i have any issues, thank you for all the help!
EDIT:
I solved it by just using the features.tsv from another 10X data set; it has the same genes in the correct order, but also all the necessary columns for Read10X to work, so it worked now.