Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jan 24:12:RP89780.
doi: 10.7554/eLife.89780.

Measures of genetic diversification in somatic tissues at bulk and single-cell resolution

Affiliations

Measures of genetic diversification in somatic tissues at bulk and single-cell resolution

Marius E Moeller et al. Elife. .

Abstract

Intra-tissue genetic heterogeneity is universal to both healthy and cancerous tissues. It emerges from the stochastic accumulation of somatic mutations throughout development and homeostasis. By combining population genetics theory and genomic information, genetic heterogeneity can be exploited to infer tissue organization and dynamics in vivo. However, many basic quantities, for example the dynamics of tissue-specific stem cells remain difficult to quantify precisely. Here, we show that single-cell and bulk sequencing data inform on different aspects of the underlying stochastic processes. Bulk-derived variant allele frequency spectra (VAF) show transitions from growing to constant stem cell populations with age in samples of healthy esophagus epithelium. Single-cell mutational burden distributions allow a sample size independent measure of mutation and proliferation rates. Mutation rates in adult hematopietic stem cells are higher compared to inferences during development, suggesting additional proliferation-independent effects. Furthermore, single-cell derived VAF spectra contain information on the number of tissue-specific stem cells. In hematopiesis, we find approximately 2 × 105 HSCs, if all stem cells divide symmetrically. However, the single-cell mutational burden distribution is over-dispersed compared to a model of Poisson distributed random mutations. A time-associated model of mutation accumulation with a constant rate alone cannot generate such a pattern. At least one additional source of stochasticity would be needed. Possible candidates for these processes may be occasional bursts of stem cell divisions, potentially in response to injury, or non-constant mutation rates either through environmental exposures or cell-intrinsic variation.

Keywords: evolutionary biology; evolutionary inferences; healthy human tissues; human; sampling; single-cell mutation burden; stem cell dynamics; varient allele frequency.

PubMed Disclaimer

Conflict of interest statement

MM, NM, BW, WH No competing interests declared

Figures

Figure 1.
Figure 1.. The distribution of variant allele frequencies changes with the growth phases and by sampling.
(a) In the current population, cells divide symmetrically into two daughter cells or asymmetrically with only one daughter cell kept in the focused population. All other events are mathematically equivalent to and are treated as a part of cell death. (b) The rates of symmetric and asymmetric division change during the population growth and lead to a dynamic distribution of variant allele frequencies. (c) The observed VAF distribution is shifted again during sampling compared to the VAF of the whole population – a fact should be considered when inferring population properties through genetic data.
Figure 2.
Figure 2.. Bulk sequencing based variant allele frequency (VAF) and mutation rate inferences in healthy esophagus.
(a) Expected VAF distributions from evolving Equation 1 to different time points for a population with an initial exponential growth phase and subsequent constant population phase (mature size N=103). Once the population reaches the maximum carrying capacity, the distribution moves from a 1/f2 growing population shape (purple) to a 1/f constant population shape (green). Note that the shift slows considerably at older age. (b) VAF from healthy tissue in the esophagus of nine individuals sorted into age brackets. The youngest bracket, 20–39, is closer to the developmental 1/f2 scaling. The older age brackets are both close to the constant population 1/f scaling, resembling the theoretical expectations. (c) Inferred mutation rates increase linearly with age. (d) Simulations of slowly growing stem cell populations reveal that mutation rates appear to increase with age, although the true underlying per division mutation rate remaining constant (see Figure 2—figure supplement 1 as well).
Figure 2—figure supplement 1.
Figure 2—figure supplement 1.. Comparison of an exponential and logistic growth model.
(a) Population size over time for exponential (dotted) and logistic (dashed) growth functions. At the point of maturity (t=20) the exponential function reaches the population capacity (NM=10000), and the logistic function equals 0.99NM. The time points at which the VAF spectra are measured are indicated by solid vertical lines: t1=25 (blue) and t2=50 (orange). (b) Comparison of the expected variant allele frequency (VAF) spectra, calculated with Equation 1, in the exponential (dotted) and logistic (dashed) growth models measured at the time points t1 and t2. For reference, the theoretical predictions for an exponentially growing population without cell death and a constant population in the long time limit are shown as solid purple and green lines, respectively.
Figure 3.
Figure 3.. Inference of evolutionary parameters on simulated stem cell populations.
Simulated populations were run up to age 59, growing exponentially from a single-cell to constant size NM=10000 at age tM=5, with mutation rate μ=1.2 and division rates λ=5 and p=0.4. Where sampling is mentioned, the sample size 89 was taken. (a) The single-cell mutational burden distribution. The compound Poisson distribution (dashed line) matches the burden distribution when averaging over multiple independently evolved populations (filled curve). (b) Distribution of estimated mutation rates from 10’000 individual simulations, obtained from burden distributions of the complete populations (blue) as well as sampled sets of cells (orange). Because the expected mutational burden distribution is unaltered by sampling, the expected estimate of the mutation rate from Equation 5 remains unchanged: E(μ~pop)=E(μ~sample). However, sampling increases the noise on the observed burden distribution, which results in a higher error margin of the estimate: σ(μ~pop)<σ(μ~sample). (c) VAF spectra measured in the complete population (blue) and a sampled set of cells (orange). In contrast with the mutational burden distribution, strong sampling changes the shape of the expected distribution. A single simulation result is shown (diamonds) alongside the theoretically predicted expected values for both the total and sampled populations (Equations 1 and 12) (dashed line) and the average across 100 simulations (solid line). (d) Distribution of NM and p inference results for 100 simulated and sampled populations, through estimation of μ~ and λ~ from the single-cell burden distribution and fitting the number of lowest frequency (1/S) mutations to the theoretical prediction in Equation 1 (see Figure 3—figure supplements 1–3 as well).
Figure 3—figure supplement 1.
Figure 3—figure supplement 1.. Stochastic simulations of the Variance/Mean of the mutational burden distribution over time for a per cell division mutation rate of μ=1.3 and varying stem cell population size N and asymmetric division probability p.
Stochastic fluctuations are pronounced for small population size N and low asymmetric division probability p.
Figure 3—figure supplement 2.
Figure 3—figure supplement 2.. Likelihood of the Variance/Mean to be in the interval 3<θ<5 for a per division mutation rate of μ=1.3.
Figure 3—figure supplement 3.
Figure 3—figure supplement 3.. Likelihood of the Variance/Mean to be in the interval 3<θ<5 for a per division mutation rate of μ=3.
Figure 4.
Figure 4.. Evolutionary inferences in single-cell hematopoietic stem cell (HSC) data.
(a) The single-cell mutational burden distribution of the data (bars) and the compound Poisson distribution obtained from its mean and variance, used to obtain the estimated per division mutation rate μ~. (b) Distribution of mutation frequencies of the data and theoretically predicted average fitted to only the lowest frequency (1/S) data point. (c) Difference Δvf between the measured value of the VAF spectrum at the lowest frequency (1/S) and its prediction from Equation 1, for varying total population size N and asymmetric division proportion p, with fixed maturation time tM=5 and operational hematopoietic population size NH=50. The solid line denotes the plane of best fit where this difference is 0. (d) Maximally inferred population size N (taking p=0 in (c)) for variation of the maturation time tN and the operational hematopoietic population size NH (see Figure 4—figure supplement 1 as well).
Figure 4—figure supplement 1.
Figure 4—figure supplement 1.. The standard deviation on the variant allele frequency (VAF) spectrum increases for higher frequencies.
(a) The VAF spectrum Vf averaged across 100 simulations of a population evolved according to the model described in section 1.1. The standard deviation from the mean is shown as the band Vf±σf around the average. (b) The standard deviation across all simulations for each frequency f=1/N,2/N,, scaled by the average spectrum Vf.

Update of

  • doi: 10.1101/2022.11.07.515470
  • doi: 10.7554/eLife.89780.1
  • doi: 10.7554/eLife.89780.2

Comment in

  • doi: 10.7554/eLife.95513

Similar articles

  • Capturing the unpredictability of stem cells.
    Durmaz A, Visconte V. Durmaz A, et al. Elife. 2024 Mar 1;13:e95513. doi: 10.7554/eLife.95513. Elife. 2024. PMID: 38427029 Free PMC article.
  • Tissue-specific mutation accumulation in human adult stem cells during life.
    Blokzijl F, de Ligt J, Jager M, Sasselli V, Roerink S, Sasaki N, Huch M, Boymans S, Kuijk E, Prins P, Nijman IJ, Martincorena I, Mokry M, Wiegerinck CL, Middendorp S, Sato T, Schwank G, Nieuwenhuis EE, Verstegen MM, van der Laan LJ, de Jonge J, IJzermans JN, Vries RG, van de Wetering M, Stratton MR, Clevers H, Cuppen E, van Boxtel R. Blokzijl F, et al. Nature. 2016 Oct 13;538(7624):260-264. doi: 10.1038/nature19768. Epub 2016 Oct 3. Nature. 2016. PMID: 27698416 Free PMC article.
  • Somatic mutation landscapes at single-molecule resolution.
    Abascal F, Harvey LMR, Mitchell E, Lawson ARJ, Lensing SV, Ellis P, Russell AJC, Alcantara RE, Baez-Ortega A, Wang Y, Kwa EJ, Lee-Six H, Cagan A, Coorens THH, Chapman MS, Olafsson S, Leonard S, Jones D, Machado HE, Davies M, Øbro NF, Mahubani KT, Allinson K, Gerstung M, Saeb-Parsy K, Kent DG, Laurenti E, Stratton MR, Rahbari R, Campbell PJ, Osborne RJ, Martincorena I. Abascal F, et al. Nature. 2021 May;593(7859):405-410. doi: 10.1038/s41586-021-03477-4. Epub 2021 Apr 28. Nature. 2021. PMID: 33911282
  • A high-fidelity method for genomic sequencing of single somatic cells reveals a very high mutational burden.
    Vijg J, Dong X, Zhang L. Vijg J, et al. Exp Biol Med (Maywood). 2017 Jul;242(13):1318-1324. doi: 10.1177/1535370217717696. Exp Biol Med (Maywood). 2017. PMID: 28737476 Free PMC article. Review.
  • Strategies for homeostatic stem cell self-renewal in adult tissues.
    Simons BD, Clevers H. Simons BD, et al. Cell. 2011 Jun 10;145(6):851-62. doi: 10.1016/j.cell.2011.05.033. Cell. 2011. PMID: 21663791 Review.

Cited by

References

    1. Abascal F, Harvey LMR, Mitchell E, Lawson ARJ, Lensing SV, Ellis P, Russell AJC, Alcantara RE, Baez-Ortega A, Wang Y, Kwa EJ, Lee-Six H, Cagan A, Coorens THH, Chapman MS, Olafsson S, Leonard S, Jones D, Machado HE, Davies M, Øbro NF, Mahubani KT, Allinson K, Gerstung M, Saeb-Parsy K, Kent DG, Laurenti E, Stratton MR, Rahbari R, Campbell PJ, Osborne RJ, Martincorena I. Somatic mutation landscapes at single-molecule resolution. Nature. 2021;593:405–410. doi: 10.1038/s41586-021-03477-4. - DOI - PubMed
    1. Bailey C, Black JRM, Reading JL, Litchfield K, Turajlic S, McGranahan N, Jamal-Hanjani M, Swanton C. Tracking cancer evolution through the disease course. Cancer Discovery. 2021;11:916–932. doi: 10.1158/2159-8290.CD-20-1559. - DOI - PMC - PubMed
    1. Black JRM, McGranahan N. Genetic and non-genetic clonal diversity in cancer evolution. Nature Reviews Cancer. 2021;21:379–392. doi: 10.1038/s41568-021-00336-2. - DOI - PubMed
    1. Cagan A, Baez-Ortega A, Brzozowska N, Abascal F, Coorens THH, Sanders MA, Lawson ARJ, Harvey LMR, Bhosle S, Jones D, Alcantara RE, Butler TM, Hooks Y, Roberts K, Anderson E, Lunn S, Flach E, Spiro S, Januszczak I, Wrigglesworth E, Jenkins H, Dallas T, Masters N, Perkins MW, Deaville R, Druce M, Bogeska R, Milsom MD, Neumann B, Gorman F, Constantino-Casas F, Peachey L, Bochynska D, Smith ESJ, Gerstung M, Campbell PJ, Murchison EP, Stratton MR, Martincorena I. Somatic mutation rates scale with lifespan across mammals. Nature. 2022;604:517–524. doi: 10.1038/s41586-022-04618-z. - DOI - PMC - PubMed
    1. Caravagna G, Heide T, Williams MJ, Zapata L, Nichol D, Chkhaidze K, Cross W, Cresswell GD, Werner B, Acar A, Chesler L, Barnes CP, Sanguinetti G, Graham TA, Sottoriva A. Subclonal reconstruction of tumors by using machine learning and population genetics. Nature Genetics. 2020;52:898–907. doi: 10.1038/s41588-020-0675-5. - DOI - PMC - PubMed