Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Aug 14;20(8):e0329122.
doi: 10.1371/journal.pone.0329122. eCollection 2025.

SpaVGN: A hybrid deep learning framework for high-resolution spatial transcriptomics data reconstruction and spatial domain identification

Affiliations

SpaVGN: A hybrid deep learning framework for high-resolution spatial transcriptomics data reconstruction and spatial domain identification

Haiyan Wang et al. PLoS One. .

Abstract

Spatial transcriptomics has revolutionized the analysis of gene expression while preserving tissue spatial information, which provides novel insights into the cellular composition and function of complex biological tissues. However, current technologies are constrained by limited resolution and data sparsity, compromising the accuracy of downstream analyses. To address these challenges, we developed SpaVGN, a deep learning framework integrating convolutional neural networks, vision transformer, and graph neural networks for high-fidelity gene expression imputation and spatial domain identification. By combining local feature extraction, global attention mechanisms, and spatial graph-based modeling, SpaVGN effectively reconstructs missing transcriptomic data while preserving spatial tissue architecture. Evaluated on melanoma and sagittal posterior mouse brain datasets, SpaVGN outperformed existing methods in gene expression prediction, achieving Pearson correlation coefficients of 0.609 (melanoma) and 0.682 (mouse brain). It clearly delineated tumor regions and lymphoid niches in melanoma tissue, achieving fine-grained resolution of hippocampal subfields, including Cornu Ammonis and Dentate Gyrus, with a Silhouette Score of 0.43 and a Davies-Bouldin Index of 0.86. Validation through UMAP dimensionality reduction and PAGA network analysis demonstrated that SpaVGN significantly mitigates the negative impact of data sparsity in spatial transcriptomics, improving data completeness and spatial continuity. This study presents an innovative solution that enhances the resolution of spatial transcriptomics data, offering cross-tissue applicability and providing a valuable tool for research in biological development, disease, and tumor heterogeneity.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Framework for SpaVGN.
Fig 2
Fig 2. Algorithm performance comparison.
(a). Pearson correlation coefficients (PCC) between predicted and true expression patterns for three representative genes (RPS25, TPT1, and MS4A1) across methods. Genes were selected from the top 3 genes with highest median PCC shared by all methods. (b). Violin plots showing gene-wise PCC distributions for all predicted genes in melanoma (left) and mouse brain (right) datasets.
Fig 3
Fig 3. Performance evaluation of SpaVGN on melanoma ST dataset.
(a). Microscopic image of melanoma tissue sections showing three main tissue types: melanoma (black arrows), stroma (red arrows), and lymphoid tissue (green arrows). Scale bar: 100 μm. (b-c). Spatial expression patterns of representative genes (CD37 and DLL3) before and after imputation. Color scale indicates normalized expression levels. (d). Performance comparison of five computational methods in tissue region segmentation. Color-coded regions correspond to different tissue domains. (e). Uniform Manifold Approximation and Projection (UMAP) visualization of cell distributions generated by five different algorithms. (f). Spatial trajectory analysis results from five methods, showing inferred developmental paths between tissue regions. Arrows indicate trajectory directions.
Fig 4
Fig 4. Performance evaluation of SpaVGN on Mouse Brain dataset.
(a). Microscopic image of the tissue section and its annotated regions, showing the distribution of different anatomical areas. (b). Performance comparison of different methods in tissue region segmentation. The left panel shows the Separation Score, and the right panel shows the Davies-Bouldin Score. (c). Visualization of spatial domain identification results by different methods. The clustering results of each method are represented by different colors. (d). UMAP analysis results of different methods. (e). Partition-based Graph Abstraction analysis results of different methods.

Similar articles

References

    1. Rao A, Barkley D, França GS, Yanai I. Exploring tissue architecture using spatial transcriptomics. Nature. 2021;596(7871):211–20. doi: 10.1038/s41586-021-03634-9 - DOI - PMC - PubMed
    1. Vickovic S, Eraslan G, Salmén F, Klughammer J, Stenbeck L, Schapiro D, et al. High-definition spatial transcriptomics for in situ tissue profiling. Nat Methods. 2019;16(10):987–90. doi: 10.1038/s41592-019-0548-y - DOI - PMC - PubMed
    1. Miller BF, Huang FY, Atta L, Sahoo A, Fan J. Reference-free cell type deconvolution of multi-cellular pixel-resolution spatially resolved transcriptomics data. Nat Commun. 2022; 13(1). doi: 10.1038/s41467-022-30033-z - DOI - PMC - PubMed
    1. Zhang M, Eichhorn SW, Zingg B, Yao Z, Cotter K, Zeng H, et al. Spatially resolved cell atlas of the mouse primary motor cortex by MERFISH. Nature. 2021;598(7879):137–43. doi: 10.1038/s41586-021-03705-x - DOI - PMC - PubMed
    1. Shah S, Takei Y, Zhou W, Lubeck E, Yun J, Eng C-HL, et al. Dynamics and Spatial Genomics of the Nascent Transcriptome by Intron seqFISH. Cell. 2018;174(2):363-376.e16. doi: 10.1016/j.cell.2018.05.035 - DOI - PMC - PubMed

LinkOut - more resources