scrna6/6

Convert a number of array shards to a single array store#

In the previous notebooks, we’ve seen how to incrementally create a collection and train models on it.

Once we have a collection of validated array shards, we might want to concatenate them to one big array store.

This is what the CELLxGENE team does to create Census: a high number of .h5ad files are concatenated to give rise to a single TileDB-SOMA array store.

This requires duplicating the data that’s present in a collection of .h5ad files, but speeds up ad-hoc queries for slices for arbitrary metadata.

See how this looks for cellxgene here: CELLxGENE: scRNA-seq.