Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Feb 1;40(2):btae027.
doi: 10.1093/bioinformatics/btae027.

DESpace: spatially variable gene detection via differential expression testing of spatial clusters

Affiliations

DESpace: spatially variable gene detection via differential expression testing of spatial clusters

Peiying Cai et al. Bioinformatics. .

Abstract

Motivation: Spatially resolved transcriptomics (SRT) enables scientists to investigate spatial context of mRNA abundance, including identifying spatially variable genes (SVGs), i.e. genes whose expression varies across the tissue. Although several methods have been proposed for this task, native SVG tools cannot jointly model biological replicates, or identify the key areas of the tissue affected by spatial variability.

Results: Here, we introduce DESpace, a framework, based on an original application of existing methods, to discover SVGs. In particular, our approach inputs all types of SRT data, summarizes spatial information via spatial clusters, and identifies spatially variable genes by performing differential gene expression testing between clusters. Furthermore, our framework can identify (and test) the main cluster of the tissue affected by spatial variability; this allows scientists to investigate spatial expression changes in specific areas of interest. Additionally, DESpace enables joint modeling of multiple samples (i.e. biological replicates); compared to inference based on individual samples, this approach increases statistical power, and targets SVGs with consistent spatial patterns across replicates. Overall, in our benchmarks, DESpace displays good true positive rates, controls for false positive and false discovery rates, and is computationally efficient.

Availability and implementation: DESpace is freely distributed as a Bioconductor R package at https://bioconductor.org/packages/DESpace.

PubMed Disclaimer

Conflict of interest statement

No competing interest is declared.

Figures

Figure 1.
Figure 1.
Spatial clusters in three samples from LIBD (Maynard et al. 2021) (Visium), melanoma (Thrane et al. 2018) (spatial transcriptomics (Ståhl et al. 2016)) and mouse cerebellum (Cable et al. 2022) (Slide-seqV2 (Stickels et al. 2021)) datasets. Spatial clusters were obtained: via manual annotations form a pathologist (LIBD), BayesSpace (Zhao et al. 2021) (melanoma), and StLearn (Pham et al. 2023) (mouse cerebellum).
Figure 2.
Figure 2.
Three examples of simulated SVGs, from the LIBD data, following bottom/right, circular, and annotations patterns. Examples of SVGs from mixture and inverted mixture patterns are presented in Supplementary Fig. S1.
Figure 3.
Figure 3.
TPR verus FDR for SVG detections in the individual sample simulations. Rows and columns refer to the anchor data used in the simulation, and to the SV profiles, respectively. BayesSpace_DESpace, BayesSpace_findMarkers, and BayesSpace_FindAllMarkers, as well as their counterparts StLearn_DESpace, StLearn_findMarkers, and StLearn_FindAllMarkers, indicate DESpace, scran’s findMarkers, and Seurat’s FindAllMarkers, respectively, based on spatial clusters computed via BayesSpace and StLearn.
Figure 4.
Figure 4.
Expression plot, for four SVGs detected with DESpace individual cluster test, on the real LIBD dataset (sample 151673). SVGs were identifying by selecting high and low expression in white matter (genes MOBP and ENC1, respectively), and high and low abundance in layer 3 (genes HOPX and HS3ST4, respectively). Lines highlight the cluster being tested.
Figure 5.
Figure 5.
TPR versus FDR for SVG detections in the multiple sample simulation. Rows and columns refer to the anchor data used in the simulation, and to the SV profiles, respectively. Note that TPRs are lower than in the individual simulation (Fig. 3), because we have simulated slightly weaker spatial patterns here (see Supplementary Details).
Figure 6.
Figure 6.
Jaccard index on the LIBD and melanoma datasets. For each method, the index is measured on the sets of top SVGs reported across replicates, and represents how coherent results are between samples: higher values indicate greater coherency.

Similar articles

Cited by

References

    1. BinTayyash N, Georgaka S, John S et al. Non-parametric modelling of temporal and spatial counts data from RNA-seq experiments. Bioinformatics 2021;37:3788–95. - PMC - PubMed
    1. Cable DM, Murray E, Zou LS et al. Robust decomposition of cell type mixtures in spatial transcriptomics. Nat Biotechnol 2022;40:517–26. - PMC - PubMed
    1. Chen A, Liao S, Cheng M et al. Spatiotemporal transcriptomic atlas of mouse organogenesis using DNA nanoball-patterned arrays. Cell 2022;185:1777–92.e21. - PubMed
    1. Chen KH, Boettiger AN, Moffitt JR et al. Spatially resolved, highly multiplexed RNA profiling in single cells. Science 2015;348:aaa6090. - PMC - PubMed
    1. Chen Y, Lun AT, Smyth GK. From reads to genes to pathways: differential expression analysis of RNA-Seq experiments using Rsubread and the edgeR quasi-likelihood pipeline. F1000Res 2016;5:1438. - PMC - PubMed

Publication types