Reading RNA-seq data into python

I’m trying to read in this dataset
https://portal.brain-map.org/atlases-and-data/rnaseq/mouse-whole-cortex-and-hippocampus-10x
with python, but I’m not sure where to start.

Can I use AllenSDK to read in this data? Is there an example code I can use?

Thanks

Hi @tabataba, I don’t have a great answer, other than to say that this is a known issue that has been partially addressed for reading into R in this post. There is another post about reading a different sparse matrix format (tome) into python which probably won’t help, but might have something useful. Maybe someone else has a solution?

Assuming I can read parts of the csv file at a time, is there any tutorial you can recommend that tells me what the other files are for?

Thanks

I would suggest you start with the Readme file listed in the first row of the data download table. It has some information about each of the other files, and in some cases links out to more detailed descriptions. The two critical files are the table of cell metadata and the gene expression matrix – for many use cases the other files are not necessary.