I was writing to ask about how we might obtain a full gene expression dataset for a handful of brain regions from the adult mouse (e.g. the DR and PAG).
We’re interested in getting data by anatomical region as opposed to voxel. Because the ABA interface already permits looking at differentially expressed genes between regions, it would be great if we could obtain the data used as the basis for that tool. My understanding is that the API does have quantified data by structure that we can access.
This information would be incredibly valuable to us. For instance, we’d want to leverage the expression data to refine the way we classify cell types in single-cell RNA sequencing.
A good starting point would be Structure Unionization (see this post). This gets the converse of what you want: all regions for one gene, but does wrap up by region as requested. I’m not sure about getting all gene data for a few regions, but maybe someone else on here knows.
Thank you! In this JSON, is there information about the gene being looked at in each section dataset? Or am I misunderstanding what you’re doing here? Really appreciate the help.
No, as it is written that information is not there, but it could be. Perhaps you can more explicitly describe what it is you need. So far we have identified expression per structure with gene information added. Anything else?
This is very close! The last step would be find out what gene is being assayed in each of the thousands of “section_data_set_ids” (and perhaps other information about each experiment) so that we can look at gene expression in the region, and potentially restrict our analyses to certain experiments.
Adding these additional fields is the equivalent of doing a database “INNER JOIN”. It is likely that not every record will have a corresponding record in another table.
If you are really interested in tracking it down further, I am going to refer you to the API documentation to write queries that return more information that you can use to help your investigation.