Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep;42(9):1372-1377.
doi: 10.1038/s41587-023-02019-9. Epub 2024 Jan 2.

Inferring super-resolution tissue architecture by integrating spatial transcriptomics with histology

Affiliations

Inferring super-resolution tissue architecture by integrating spatial transcriptomics with histology

Daiwei Zhang et al. Nat Biotechnol. 2024 Sep.

Abstract

Spatial transcriptomics (ST) has demonstrated enormous potential for generating intricate molecular maps of cells within tissues. Here we present iStar, a method based on hierarchical image feature extraction that integrates ST data and high-resolution histology images to predict spatial gene expression with super-resolution. Our method enhances gene expression resolution to near-single-cell levels in ST and enables gene expression prediction in tissue sections where only histology images are available.

PubMed Disclaimer

Figures

Extended Data Fig. 1.
Extended Data Fig. 1.. Correlation between super-resolution gene expression and histology.
Comparison of iStar’s super-resolution gene expression patterns with the paired histology image in the Xenium-derived pseudo-Visium data. Spot boundaries are highlighted.
Extended Data Fig. 2.
Extended Data Fig. 2.. Resolution enhancement at various scales.
Visualization of predicted super-resolution gene expressions by iStar at various scales of resolution enhancement for three breast cancer-related genes (ESR1, ERBB2, and PGR) in the Xenium-derived pseudo-Visium data.
Extended Data Fig. 3.
Extended Data Fig. 3.. Numerical evaluation of prediction accuracy.
Prediction accuracy of iStar and XFuse as measured by root mean squared error (RMSE) and structural similarity index measure (SSIM) for all the 313 genes in the Xenium-derived pseudo-Visium data obtained from a breast cancer patient. In each scatter plot, a dot represents a gene. In this analysis, Section 1 was first treated as the training sample in which in-sample prediction was perfromed for Section 1 and then out-of-sample prediction was perfromed for Section 2. We then repeated this analysis by treating Section 2 as the ‘in-sample’ and Section 1 as the ‘out-of-sample’. The evaluation metrics in’Average’ is the average of those in ‘Sections 1’ and ‘Section 2’.
Extended Data Fig. 4.
Extended Data Fig. 4.. Visualization of single-cell level gene expression prediction.
Visualization of single-cell level gene expression predicted by iStar for the pseudo-Visium breast cancer data derived from Xenium data. Shown on the left is the ground truth single-cell level gene expression directly measured by Xenium, and shown on the right is the single-cell level gene expression predicted by iStar. For each gene, the root mean squared error (RMSE) and Pearson’s correlation coefficient (PCC) between the prediction and the ground truth across the whole tissue and within the shown region are displayed.
Extended Data Fig. 5.
Extended Data Fig. 5.. Prediction accuracy evaluation of single-cell level gene expression prediction.
Prediction accuracy of iStar and XFuse for single-cell level gene expression prediction as measured by root mean squared error (RMSE) for all the 313 genes in the Xenium-derived pseudo-Visium data obtained from a breast cancer patient. In each scatter plot, a dot represents a gene. In this analysis, Section 1 was first treated as the training sample in which in-sample prediction was performed for Section 1 and then out-of-sample prediction was performed for Section 2. We then repeated this analysis by treating Section 2 as the ‘in-sample’ and Section 1 as the ‘out-of-sample’. The evaluation metrics in’Average’ is the average of those in ‘Sections 1’ and ‘Section 2’. We stratified cells by the quantiles of their cell size.
Extended Data Fig. 6.
Extended Data Fig. 6.. Super-resolution vs spot-level signature scoring of tertiary lymphoid structures (TLSs).
iStar detected tertiary lymphoid structures (TLSs) in the Anderson et al. (2021) HER2+ breast cancer dataset. Displayed are the predicted TLS scores by iStar and the original publication, along with the pathologist’s manual annotation reported in the original publication. Super-resolution was performed with 128x resolution enhancement.
Extended Data Fig. 7.
Extended Data Fig. 7.. Super-resolution gene expression prediction in the Xenium-derived pseudo-Visium mouse brain data.
Visualization of the spot-level training data, ground truth gene expression, and predicted super-resolution gene expressions by iStar and XFuse for 24 highly variable genes, whose variances are in the 80%−100% quantiles among all the 248 genes in the Xenium-derived pseudo-Visium data obtained from mouse brain. The variance quantiles of the genes in the order of top-left, top-right, bottom-left, bottom-right are equally spaced from 100% to 80% (in descending order). Super-resolution gene expressions are visualized at the scale of 8x resolution enhancement.
Extended Data Fig. 8.
Extended Data Fig. 8.. Prediction accuracy evaluation and gene-based segmentation comparison in the Xenium-derived pseudo-Visium mouse brain data.
a. Prediction accuracy of by iStar and XFuse as measured by root mean squared error (RMSE) and structural similarity index measure (SSIM) for all the 248 genes in the Xenium-derived pseudo-Visium data obtained from mouse brain. In each scatter plot, a dot represents one gene. b. Segmentation of the Xenium-derived pseudo-Visium data obtained from mouse brain by iStar and XFuse using all 248 genes available in this dataset. Super-resolution was performed with 128x resolution enhancement.
Extended Data Fig. 9.
Extended Data Fig. 9.. Gene-based segmentation of mouse brain datasets.
Analyses of a. mouse brain (coronal cut), b. mouse brain posterior (sagittal cut), and c. mouse brain ofactory bulb Visium datasets generated by 10x Genomics. For each dataset, iStar was applied to enhance the resolution of the top 1000 most highly variable genes. Super-resolution was performed with 128x resolution enhancement. Segmentations by iStar identified fine-grained tissue structures in the mouse brain and agreed with the Allen Brain Atlas annotations.
Extended Data Fig. 10.
Extended Data Fig. 10.. Gene-based tissue segmentation and tertiary lymphoid structure (TLS) signature scoring of cancer datasets and a mouse kidney dataset.
Comparison of gene-based segmentation by iStar with manual tissue annotation in a. mouse kidney and b. prostate cancer Visium datasets generated by 10x Genomics. c. Gene-based segmentation by iStar in a colorectal cancer Visium dataset by 10x Genomics. d. Comparison of tertiary lymphoid structure (TLS) signature score by iStar with manual TLS annotation in a kidney cancer Visium dataset generated by Meylan et al. (2022).
Fig. 1 |
Fig. 1 |. Workflow and super-resolution gene expression prediction accuracy of iStar.
a, Model summary of iStar. The histology image is hierarchically divided into tiles, which are then converted into hierarchical histology image features. These features, combined with the spot-level gene expression data, are then utilized to predict super-resolution gene expression. Finally, tissue architecture is inferred based on the super-resolved gene expression prediction. b, c, Evaluation of super-resolution gene expression prediction accuracy using the Xenium breast cancer dataset, which includes two consecutively cut tissue sections. The Xenium data served as the ground truth and were used to simulate spot-level gene expression based on the spot size and layout of Visium. For in-sample prediction, both model training and prediction were performed using section 2’s pseudo-Visium data. For out-of-sample prediction, section 1 was used for model training and section 2 was used for prediction in which only its histology image was used as input. b, Visual comparison between iStar and XFuse. ERBB2, ESR1, and PGR are genes that encode biomarkers for breast cancer prognosis, while MS4A1 is a B cell marker gene. Super-resolution gene expressions are visualized at the scale of 8x resolution enhancement. Visualizations of additional genes at the scale of 8x resolution enhancement are presented in Supplementary Figs. 1–5. Visualizations at other resolutions are available in Supplementary Fig. 6 and at https://upenn.box.com/v/istar-results-benchmark. c, Numerical comparison between iStar and XFuse for 128x resolution enhancement of gene expression. The degree of resolution enhancement is defined as the number of superpixels in the super-resolution prediction divided by the number of spots in the training data. Each dot represents one of the 313 genes. Additional numerical evaluations are reported in Supplementary Fig. 7. d, Predicted single cell-level gene expression, which was computed from the predicted superpixel-level gene expression using the cell segmentation masks provided in the dataset. Additional examples are presented in Supplementary Fig. 9.
Fig. 2 |
Fig. 2 |. Tissue annotation using iStar.
a, Comparison of unsupervised tissue segmentation by iStar and XFuse with manual annotation of one of the two consecutively cut tissue sections of a breast cancer patient in the Xenium dataset. The model was trained using the pseudo-Visium spot-level gene expression simulated from Section 1 (in-sample) of the Xenium data. Section 2 was treated as the out-of-sample section, and its super-resolution gene expression was predicted only using its histology image. Super-resolution was performed with 128x resolution enhancement. b, Tissue architecture annotation of a breast cancer tissue in the HER2ST breast cancer dataset (three consecutively cut tissue sections in Subject H). Super-resolution was performed with 128x resolution enhancement. c, iStar assigned biologically meaningful labels to the tissue clusters by performing superpixel-level cell type inference, followed by a cell type enrichment analysis, where depletion, i.e., negative enrichment, was not shown in the heatmap. d, iStar’s unsupervised tissue segmentation revealed intraltumoral heterogeneity that agreed with the pathologist’s manual annotation. e, A small cancer region detected by iStar that was missed in manual annotation provided in the original publication. f, Detection of Tertiary Lymphoid Structures (TLS) in the HER2ST breast cancer dataset (Subject G) by iStar. The TLS score was calculated as the mean of the standardized super-resolution gene expressions of the TLS marker genes shown in Supplementary Table 1.

References

    1. Burgess DJ Spatial transcriptomics coming of age. Nat. Rev. Genet 20, 317 (2019). - PubMed
    1. Asp M, Bergenstrahle J. & Lundeberg J. Spatially Resolved Transcriptomes-Next Generation Tools for Tissue Exploration. Bioessays 42, e1900221 (2020). - PubMed
    1. Crosetto N, Bienko M. & van Oudenaarden A. Spatially resolved transcriptomics and beyond. Nat. Rev. Genet 16, 57–66 (2015). - PubMed
    1. Moor AE & Itzkovitz S. Spatial transcriptomics: paving the way for tissue-level systems biology. Curr. Opin. Biotechnol 46, 126–133 (2017). - PubMed
    1. Hu J. et al. SpaGCN: Integrating gene expression, spatial location and histology to identify spatial domains and spatially variable genes by graph convolutional network. Nat. Methods 18, 1342–1351 (2021). - PubMed

Methods-only References

    1. Steiner A. et al. How to train your vit? data, augmentation, and regularization in vision transformers. Preprint at arXiv, 10.48550/arXiv.2106.10270 (2021). - DOI
    1. Hölscher DL et al. Next-Generation Morphometry for pathomics-data mining in histopathology. Nat. Commun 14, 470 (2023). - PMC - PubMed
    1. Rappez L. et al. SpaceM reveals metabolic states of single cells. Nat. Methods 18, 799–805 (2021). - PMC - PubMed
    1. Xu B, Wang N, Chen T. & Li M. Empirical evaluation of rectified activations in convolutional network. Preprint arXiv, 10.48550/arXiv.1505.00853 (2015). - DOI
    1. Clevert D-A, Unterthiner T. & Hochreiter S. Fast and accurate deep network learning by exponential linear units (ELUs). Preprint at arXiv, 10.48550/arXiv.1511.07289 (2015). - DOI

LinkOut - more resources