Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Sep;18(9):e11080.
doi: 10.15252/msb.202211080.

Vesalius: high-resolution in silico anatomization of spatial transcriptomic data using image analysis

Affiliations

Vesalius: high-resolution in silico anatomization of spatial transcriptomic data using image analysis

Patrick C N Martin et al. Mol Syst Biol. 2022 Sep.

Abstract

Characterization of tissue architecture promises to deliver insights into development, cell communication, and disease. In silico spatial domain retrieval methods have been developed for spatial transcriptomics (ST) data assuming transcriptional similarity of neighboring barcodes. However, domain retrieval approaches with this assumption cannot work in complex tissues composed of multiple cell types. This task becomes especially challenging in cellular resolution ST methods. We developed Vesalius to decipher tissue anatomy from ST data by applying image processing technology. Vesalius uniquely detected territories composed of multiple cell types and successfully recovered tissue structures in high-resolution ST data including in mouse brain, embryo, liver, and colon. Utilizing this tissue architecture, Vesalius identified tissue morphology-specific gene expression and regional specific gene expression changes for astrocytes, interneuron, oligodendrocytes, and entorhinal cells in the mouse brain.

Keywords: anatomical territories; spatial domains; spatial transcriptomics; tissue architecture; tissue heterogeneity.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Describing Tissue territories with Vesalius
  1. A

    Vesalius embeds ST data into RGB‐colored images. This is achieved by preprocessing ST data and reducing dimensionality. In parallel, punctual ST coordinates are converted into tiles. Finally, the UMAP latent space (or PCA loading values) is transformed into an RGB color space, and the color code attributed to each barcode is assigned to its respective tile.

  2. B

    Vesalius applies image analysis techniques to RGB images describing the transcriptional landscape of a tissue with the aim of isolating tissue territories.

  3. C

    Vesalius enables a territory‐based ST framework including spatial territory clustering, territory comparison, tissue border expression, and morphology‐driven expression.

Figure 2
Figure 2. Vesalius outperforms other Spatial domain tools in heterogenous territories
  1. A

    Adjusted Rand Index score between Vesalius, BayesSpace, Giotto, Seurat, STAGATE, SpaGCN, and SEDR. Overall, Vesalius outperforms other tools in retrieving spatial domains in high‐resolution simulated data sets (Kruskal–Wallis and star notation using Wilcoxon rank‐sum test for the multiple comparisons with n = 10 simulation replicates).

  2. B

    The high performance of Vesalius compared with competitors is also highlighted using the variation of information score, which is more robust against cluster granularity. Boxplot shows VI scores over 10 simulation replicates (Kruskal–Wallis and star notation using Wilcoxon rank‐sum test for the multiple comparisons with n = 10 simulation replicates).

  3. C

    Vesalius has the lowest run time among spatial domain tools with a run time 3–20 times lower than competing tools (Kruskal–Wallis and star notation using Wilcoxon rank‐sum test for the multiple comparisons with n = 80 simulation replicates).

  4. D

    Example of simulated ground truth and prediction by each tool in four simulated regimes (Dotted, Exponential, Pure, and Uniform). The label above each plot describes the simulation run in the following format Tool–Regime–Number of territories–Number of cells–Simulation replicate. For example, Vesalius Uni 3 3 1 describes Vesalius's prediction in the uniform regime, which contains three territories each with three cell types in replicate 1.

Data information: Box plots show five summary statistics: the median, two hinges representing the interquartile range (IQR) and two whiskers representing 1.5 times the IQR from the median. All dots are considered as outliers from this distribution.
Figure 3
Figure 3. Vesalius recovers uniform anatomical territories in high‐resolution ST data
  1. A

    Vesalius accurately recovers tissue territories in Slide‐seqV2 data taken from the mouse hippocampus and surrounding brain (Puck_200115_08). A comparison with the Allen Brain Atlas reference atlas illustrates that Vesalius recovers many structures such as the dentate gyrus, corpus callosum, and the CA field.

  2. B

    BayesSpace and Seurat applied to the same data set recover structures insofar as these structures contain homogenous cell populations. The identified clusters are dispersed over the entire tissue section and thus do not represent a clear tissue territory.

  3. C

    Vesalius recovered uniform tissue territories in the mouse embryo (Slide‐seqV2—Puck_190926_03). The microscopy image highlights the section of the embryo used to produce Puck_190926_03 (image taken from Slide‐seq V2; Stickels et al, 2021).

  4. D

    BayesSpace and Seurat recover structures insofar as these structures contain homogenous cell populations. The identified clusters are dispersed over the entire tissue section and thus do not represent a clear tissue territory.

  5. E

    Vesalius identified tissue territory using Seq‐Scope early‐onset liver failure (Sample 2117). Vesalius highlights various hepatocyte populations such as pericentral hepatocytes (Hep. PC), periportal hepatocytes (Hep. PP), and injured hepatocytes (Injured Hep.).

  6. F

    Vesalius identified tissue territory using Seq‐Scope in healthy colon (Sample 2111). Territory isolation in the colon shows various structures and layers including the smooth muscle, the crypt surface, and the crypt base.

  7. G

    Vesalius recovers tissue territories in seqFISH data in mouse embryo (right) and can distinguish between different brain regions compared with the single‐cell annotation (left).

  8. H–J

    In low‐resolution Visium 10X data sets, Vesalius accurately recovers tissue territories in a broad range of tissue such as mouse brain (H), mouse kidney (I), and human breast cancer (J).

Figure 4
Figure 4. In‐depth analysis of isolated territories reveals the finer details of spatial patterning
  1. A

    Mapping of clustered barcodes in the isolated CA field. Vesalius recovers all 3 CA pyramidal layers.

  2. B

    CA2 pyramidal layer was enriched with, among others, Pcp4, a canonical CA2 layer marker. The ISH image taken from the Allen Brain Atlas corroborates the positioning of the CA2 layer within the isolated CA field.

  3. C

    Pcp4 expression within the CA2 layer is lost in favor of stronger expression in the thalamus.

  4. D

    The isolated medial habenula and third ventricle show distinct spatial compartments after barcode clustering.

  5. E

    ISH image describing a medial habenula lower compartment marker (Gabbr2) and a medial habenula upper compartment marker (Calb2). The medial habenula is highlighted in red.

  6. F

    ISH images of lower third ventricle marker (Nnat) and upper third ventricle marker (Enpp2).

  7. G

    Barcode clustering and mapping of the embryonic eye (E12.5) show that Anterior Lens Epithelial cells and Lens Vesicle cells are separated into distinct layers.

  8. H

    Differential gene expression analysis between Anterior Lens Epithelial Cell layers revealed that the expression of Cryba4 was restricted to the inner layer while Cnnd2 was expressed in the outer layer. Similarly, Pmel was expressed in the outer Lens Vesicle cell layer and Aldh1a1 was expressed in the inner layer.

Figure 5
Figure 5. Investigation of territories reveals gene expression patterns linked to neighboring tissues and tissue morphology
  1. A

    Differential gene expression analysis between cells contained in the cortex and the thalamus shows that spatial location influences gene expression. Astrocytes in the cortex are enriched with Cpe, while entorhinal cells in the thalamus are enriched with Nrgn.

  2. B

    Barcode clustering of the isolated dentate gyrus reveals transcriptional dissimilarity between each dentate gyrus (DG) layer.

  3. C

    Differential gene expression analysis between DG Granule cell layer and DG sub‐granular zone displayed a high expression of Cst3 and Apoe at the border between layers. Cst3 and Apoe border expression is corroborated by Allen brain Atlas ISH images.

  4. D

    Layered expression pattern of Stmn4 and Kif5a within the isolated corpus callosum showed a higher expression at the center of the corpus callosum. ISH images corroborate the spatial expression pattern of both genes. Corpus callosum contained within red lines.

Figure EV1
Figure EV1. Vesalius methods visualized
  1. A

    Vesalius converts the transcriptome of cells by using a normalized latent space (either UMAP or PCA) and using embedding values as RGB color values.

  2. B

    After image segmentation, Vesalius further isolates territories by pooling barcodes that are close to each other in 2D space. Vesalius finds all barcodes that are within a capture distance of each other (represented by red circle) and assigns a unique territory to all these barcodes. The same process is applied to all beads of that color segment until all beads have been pooled into a distinct spatial territory.

  3. C

    Simulation regimes used to benchmark Vesalius in high‐resolution ST data. The pure regime only contains one cell type per territory. The uniform regime contains n different cell types in equal proportion in each territory. The exponential regime contains n cell types in varying proportions between territories. The cell types are the same between territories. The dotted regime is made of a background territory with five circular territories of random size. The number of cell types between each territory is also randomized. Circular territories may overlap.

  4. D

    Territory layering uses images representation of territory to iteratively select the edge of a territory and assign a layer value to that edge. Once all barcodes have been assigned to a layer, the number of layers can be reduced by merging neighboring layers.

Figure EV2
Figure EV2. BayesSpace spatial cluster in mouse hippocampus
Visualizing BayesSpace clusters separately emphasizes that many of the spatial domains do not represent a specific anatomical structure. In this context, selecting a specific location in the tissue is impossible without either missing barcodes or by including unwanted barcodes. This also demonstrates why cluster sub‐clustering does not equate to territory isolation and clustering.
Figure EV3
Figure EV3. Seurat clusters in mouse hippocampus
Visualizing Seurat clusters separately emphasizes that many of the clusters do not represent a specific anatomical structure. In this context, selecting a specific location in the tissue is impossible without either missing barcodes or by including unwanted barcodes. This also demonstrates why cluster sub‐clustering does not equate to territory isolation and clustering. Seurat is not designed to recover spatial domains but rather cluster transcriptional similarity.
Figure EV4
Figure EV4. BayesSpace spatial cluster in mouse embryo
Visualizing BayesSpace clusters separately emphasizes that many of the spatial domains do not represent a specific anatomical structure. In this context, selecting a specific location in the tissue is impossible without either missing barcodes or by including unwanted barcodes. This also demonstrates why cluster sub‐clustering does not equate to territory isolation and clustering.
Figure EV5
Figure EV5. Seurat clusters in mouse embryo
Visualizing Seurat clusters separately emphasizes that many of the clusters do not represent a specific anatomical structure. In this context, selecting a specific location in the tissue is impossible without either missing barcodes or by including unwanted barcodes. This also demonstrates why cluster sub‐clustering does not equate to territory isolation and clustering. Seurat is not designed to recover spatial domains but rather cluster transcriptional similarity.

References

    1. Barthelmé S, Tschumperlé D (2019) Imager: an R package for image processing based on CImg. J Open Source Softw 4: 1012
    1. Bergenstråhle J, Larsson L, Lundeberg J (2020) Seamless integration of image and molecular analysis for spatial transcriptomics workflows. BMC Genomics 21: 1–7 - PMC - PubMed
    1. Burgess DJ (2019) Spatial transcriptomics coming of age. Nat Rev Genet 20: 317 - PubMed
    1. Cable DM, Murray E, Zou LS, Goeva A, Macosko EZ, Chen F, Irizarry RA (2021) Robust decomposition of cell type mixtures in spatial transcriptomics. Nat Biotechnol 40: 517–526 - PMC - PubMed
    1. Chen KH, Boettiger AN, Moffitt JR, Wang S, Zhuang X (2015) Spatially resolved, highly multiplexed RNA profiling in single cells. Science 348: aaa6090 - PMC - PubMed

Publication types