Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jun 11;15(1):4987.
doi: 10.1038/s41467-024-49174-4.

Optimizing the design of spatial genomic studies

Affiliations

Optimizing the design of spatial genomic studies

Andrew Jones et al. Nat Commun. .

Abstract

Spatial genomic technologies characterize the relationship between the structural organization of cells and their cellular state. Despite the availability of various spatial transcriptomic and proteomic profiling platforms, these experiments remain costly and labor-intensive. Traditionally, tissue slicing for spatial sequencing involves parallel axis-aligned sections, often yielding redundant or correlated information. We propose structured batch experimental design, a method that improves the cost efficiency of spatial genomics experiments by profiling tissue slices that are maximally informative, while recognizing the destructive nature of the process. Applied to two spatial genomics studies-one to construct a spatially-resolved genomic atlas of a tissue and another to localize a region of interest in a tissue, such as a tumor-our approach collects more informative samples using fewer slices compared to traditional slicing strategies. This methodology offers a foundation for developing robust and cost-efficient design strategies, allowing spatial genomics studies to be deployed by smaller, resource-constrained labs.

PubMed Disclaimer

Conflict of interest statement

B.E.E. is on the SAB of Creyon Bio, Arrepath, and Freenome. B.E.E. is a consultant with Neumora. The remaining authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Demonstration of slicing in two-dimensional simulated tissue.
a Simulated spherical tissue with a grid of spots. b An example one-dimensional slice through the tissue. c The resulting observations at each spot after taking the slice in (b). The colors represent a univariate phenotype. d After slicing, the simulated tissue is split into two fragments. e Each line represents a candidate one-dimensional slice. Each slice is colored by its EIG (normalized to have a maximum of one). The slice with the highest EIG is then chosen; see panel (f). f The EIG-maximizing slice from the candidates in panel (e). g Tissue fragments after T = 10 iterations of repeated slicing. Each color represents a distinct fragment. Source data are provided as a Source Data file.
Fig. 2
Fig. 2. Imputing unobserved gene expression from observed cross-sections.
a Simulated tissue colored by synthetic gene expression. b An example slice through the synthetic tissue. c The resulting observations from the slice in (b). d R2 for gene expression imputation after each slicing iteration for each method. Error bands represent 95% confidence intervals computed using n = 5 runs. Source data are provided as a Source Data file.
Fig. 3
Fig. 3. Synthetic slicing experiment for localizing a region of interest.
a Two-dimensional simulated spatial gene expression data with a region of interest in orange (ROI). b Point-wise observations. Orange points are labeled as belonging to the ROI, blue points are outside the ROI, and gray points are unobserved. c Estimated expected information gain (EIG) for each spatial location, where each design is a single point. d Estimated EIG for each horizontal slice design. e Synthetic ROI data. f Slices chosen after T = 5 iterations of running our model. g Mean F1 score of predictions after each iteration. Error bars represent 95% confidence intervals computed using n = 5 runs. Source data are provided as a Source Data file.
Fig. 4
Fig. 4. Application to Visium data.
a Spatial locations of tissue. b Slices chosen by each approach after T = 5 iterations. The outline of the tissue is shown by the solid black line, and the slices chosen by each approach are shown by the dashed lines. The color legend is in panel (c). The full lines are drawn for clarity, but only the nonintersecting piece (within one tissue fragment) is considered as the relevant slice. c Predictive R2 of the held-out gene expression for both approaches across iterations. Error bands represent 95% confidence intervals computed using n = 5 runs. Source data are provided as a Source Data file.
Fig. 5
Fig. 5. Reconstructing the Allen Brain Atlas.
a Allen Brain Atlas coordinates colored by the expression of PCP4. b An example slice through the coordinates. c The resulting observations after taking this slice. d The slices and observations chosen by the EIG approach. e Imputation performance across experimental iterations. Error bars represent 95% confidence intervals computed using n = 5 runs. Source data are provided as a Source Data file.
Fig. 6
Fig. 6. Localizing invasive carcinoma in prostate tissue.
a Slices chosen by the EIG method. Cancerous spots are shown in red. The full lines are drawn for clarity, but only the nonintersecting piece (within one tissue fragment) is considered as the relevant slice. b F1 score of tumor/healthy label predictions after each iteration of experimental design. Error bars represent 95% confidence intervals computed using n = 5 runs. c Tumor/healthy predictions following five iterations of design. Stronger yellow color indicates spots with higher predicted probability of containing tumorous tissue. Source data are provided as a Source Data file.

Update of

References

    1. Moses L, Pachter L. Museum of spatial transcriptomics. Nat. Methods. 2022;19:534–546. doi: 10.1038/s41592-022-01409-2. - DOI - PubMed
    1. Lein ES, et al. Genome-wide atlas of gene expression in the adult mouse brain. Nature. 2007;445:168–176. doi: 10.1038/nature05453. - DOI - PubMed
    1. Zhang M, et al. Spatially resolved cell atlas of the mouse primary motor cortex by MERFISH. Nature. 2021;598:137–143. doi: 10.1038/s41586-021-03705-x. - DOI - PMC - PubMed
    1. Saviano A, Henderson NC, Baumert TF. Single-cell genomics and spatial transcriptomics: Discovery of novel cell states and cellular interactions in liver physiology and disease biology. J. Hepatol. 2020;73:1219–1230. doi: 10.1016/j.jhep.2020.06.004. - DOI - PMC - PubMed
    1. Mantri M, et al. Spatiotemporal single-cell RNA sequencing of developing chicken hearts identifies interplay between cellular differentiation and morphogenesis. Nat. Commun. 2021;12:1–13. doi: 10.1038/s41467-021-21892-z. - DOI - PMC - PubMed