The impact of genetic modifiers on variation in germline mutation rates within and among human populations

William R Milligan¹, Guy Amster^{1

2}, Guy Sella^{1

3}

Affiliations

¹ Department of Biological Sciences, Columbia University, New York, NY 10027, USA.
² Flatiron Health Inc., New York, NY 10013, USA.
³ Program for Mathematical Genomics, Columbia University, New York, NY 10032, USA.

PMID: 35666194
PMCID: PMC9339295
DOI: 10.1093/genetics/iyac087

The impact of genetic modifiers on variation in germline mutation rates within and among human populations

William R Milligan et al. Genetics. 2022.

. 2022 Jul 30;221(4):iyac087.

doi: 10.1093/genetics/iyac087.

Authors

William R Milligan¹, Guy Amster^{1

2}, Guy Sella^{1

3}

Affiliations

¹ Department of Biological Sciences, Columbia University, New York, NY 10027, USA.
² Flatiron Health Inc., New York, NY 10013, USA.
³ Program for Mathematical Genomics, Columbia University, New York, NY 10032, USA.

PMID: 35666194
PMCID: PMC9339295
DOI: 10.1093/genetics/iyac087

Abstract

Mutation rates and spectra differ among human populations. Here, we examine whether this variation could be explained by evolution at mutation modifiers. To this end, we consider genetic modifier sites at which mutations, "mutator alleles," increase genome-wide mutation rates and model their evolution under purifying selection due to the additional deleterious mutations that they cause, genetic drift, and demographic processes. We solve the model analytically for a constant population size and characterize how evolution at modifier sites impacts variation in mutation rates within and among populations. We then use simulations to study the effects of modifier sites under a plausible demographic model for Africans and Europeans. When comparing populations that evolve independently, weakly selected modifier sites (2Nes≈1), which evolve slowly, contribute the most to variation in mutation rates. In contrast, when populations recently split from a common ancestral population, strongly selected modifier sites (2Nes≫1), which evolve rapidly, contribute the most to variation between them. Moreover, a modest number of modifier sites (e.g. 10 per mutation type in the standard classification into 96 types) subject to moderate to strong selection (2Nes>1) could account for the variation in mutation rates observed among human populations. If such modifier sites indeed underlie differences among populations, they should also cause variation in mutation rates within populations and their effects should be detectable in pedigree studies.

Keywords: germline mutations; human evolution; modifiers; mutation rate; mutation spectra; mutators; pedigree studies; population genetics.

PubMed Disclaimer

Figures

**Fig. 1.**
Effect of modifier sites on (a) expected mutation rates and (b) variance of mean mutation rates. We estimate both quantities and their standard error (SE) for each combination of $M$ and the scaled selection parameter $4 N Lhs ϕ$ (varying $ϕ$ ) from simulations (see *Simulations*). In most cases, the SEs are too small to see and therefore not shown. Analytical results are calculated from Equations (13) and (14). We assume $N = 2 \times 10^{3}$ with other parameters chosen to match population-scaled values in humans (see *Parameters* and Table 1).

**Fig. 2.**
Upper bound on the number of modifier sites (a) and on variance in mean mutation rates (b), assuming that the expected mutation rate equals the estimated rate in humans. We show both quantities for modifier sites that affect one of 96 mutation types (dashed lines) and for those that affect all mutation types (solid lines). For modifier sites that only affect a single mutation type, we show both their effect on variance at sites they affect and on the total mutation rate. Each quantity was calculated using Equations (16) and (17). We assumed scaled parameter values roughly based on humans with $N = 2 \times 10^{4}$ (see *Parameters* and Table 1).

**Fig. 3.**
Summary of parameter ranges where modifier sites generate population specific peaks in mutation rates. For a given $M$ and $ϕ$ , we show the probability that (a) a single mutation rate trajectory or (b) sets of 96 trajectories are peak-like, or the probability that (c) the largest enrichment per set is greater than 1.1 (see *Tests of variance between populations*). (d) A cartoon illustrating how we categorize trajectories based on the number of windows that are above the elevated or peaked thresholds. For individual trajectories (a), we show the probability of being multielevated (interior color). For sets of trajectories (b), we show the expected number of peak-like trajectories in a peak-like set (interior color; null values are colored white). For both (a and b), we show whether most trajectories or sets are not-peaked or multielevated (edge color). For enrichments (c), we show the 95% quantile of enrichment values (interior color). We vary $ϕ$ to span the range of selection parameters and vary $M$ between one and the maximum number of modifier sites possible, $M^{*} (ϕ)$ (Equation 16). In Supplementary Figs. 14–17, we show similar results for models that relax some of our simplifying assumptions.

**Fig. 4.**
Parameter ranges in which both enrichments and peak-like sets occur with probability $\geq 0.01$ under different models and criteria for peak-like trajectories. a) Our basic model. b) A model in which mutation types have different expected rates. c) A model in which mutator alleles are selected on due to their effects on germline and somatic mutation rates. d) A model in which mutator alleles affect multiple mutation types. For all models, we show the parameter ranges for our restrictive (dark grey) and relaxed (light grey) definitions of peak-like trajectories (see *Tests of variance in mutation rates between populations* and Supplementary Fig. 14 for definitions). See Supplementary Figs. 15–17 for additional details about and more results for each model (akin to Fig. 3). We calculate the maximal number of modifier sites for a given scaled selection parameter (which varies across models) from Equation (16).

**Fig. 5.**
Variance in mutation rates within a population as a function of the effect size of modifier sites. We show the expected variance within a population (relative to a fixed expected rate, $E (V (u| q)) / E^{2} (u)$ ) at steady state arising from: (a) one modifier site and (b) the maximum number of modifier sites conditional on a fixed expected rate (Equation 16). For simulated results with a single site (a), we estimate the average heterozygosity at modifier sites in simulations with $M = 10^{3}$ (calculated as described in *Simulations*) and the contribution to variance within a population relying on Equation (11). When assuming the maximum number of modifier sites (b), this estimated contribution and its SE are scaled by $M^{*} (ϕ)$ (based on Equation (16) and assuming $\hat{u} = 1.25 \times 10^{- 7} / bp / gen$ , the estimated human mutation rate scaled for population size). In most cases, the SEs are too small to see. We calculate the analytical predictions from the expected heterozygosity, which is calculated by integrating over the stationary distribution based on Equation (15); we also show the results of the simple approximations described in the text. For comparison, we show the corresponding variance between populations (gray curve) based on Equation (14). Both simulated and analytical results assume $N = 2 \times 10^{3}$ with other parameters chosen to match population-scaled values in humans (see *Parameters* and Table 1). These results are insensitive to plausible variation in model parameters (Supplementary Fig. 3).

**Fig. 6.**
Identifying the effects of mutator alleles in a pedigree study of mutation rates. (a) The expected number of de novo mutations (± 1 SD) in trios with and without a mutator. (b) The expected number of mutators sampled (± 1 SD) in a study with 1,500 trios, based on our steady state approximations [Equation (8) and Supplementary Section 6]. The vertical red lines denote the minimum effect size at which we expect a mutator to be detectable (with $P \leq 0.05)$ . For the purpose of this illustration, we assume a simple null model of mutations in the absence of mutators and ignore potential complications. We further assume that mutators affect all mutations (instead of, e.g. a specific mutation type) and rely on parameter values that are plausible for humans (Table 1) while varying $ϕ$ .

See this image and copyright information in PMC

References

1. Aarnio M, Sankila R, Pukkala E, Salovaara R, Aaltonen LA, de la Chapelle A, Peltomäki P, Mecklin J-P, Järvinen HJ.. Cancer risk in mutation carriers of DNA-mismatch-repair genes. Int J Cancer. 1999;81(2):214–218. doi:10.1002/(SICI)1097-0215(19990412)81:2<214::AID-IJC8>3.0.CO;2-L. - DOI - PubMed
1. Aikens RC, Johnson KE, Voight BF.. Signals of variation in human mutation rate at multiple levels of sequence context. Mol Biol Evol. 2019;36(5):955–965. doi:10.1093/molbev/msz023. - DOI - PMC - PubMed
1. Alexandrov LB, Kim J, Haradhvala NJ, Huang MN, Tian Ng AW, Wu Y, Boot A, Covington KR, Gordenin DA, Bergstrom EN, et al.; PCAWG Consortium. The repertoire of mutational signatures in human cancer. Nature. 2020;578(7793):94–101. doi:10.1038/s41586-020–1943-3. - PMC - PubMed
1. Anandakrishnan R, Varghese RT, Kinney NA, Garner HR.. Estimating the number of genetic mutations (hits) required for carcinogenesis based on the distribution of somatic mutations. PLoS Comput Biol. 2019;15(3):e1006881. doi:10.1371/journal.pcbi.1006881. - DOI - PMC - PubMed
1. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al.Gene ontology: tool for the unification of biology. Nat Genet. 2000;25(1):25–29. doi:10.1038/75556. - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

The impact of genetic modifiers on variation in germline mutation rates within and among human populations

Affiliations

The impact of genetic modifiers on variation in germline mutation rates within and among human populations

Authors

Affiliations

Abstract

Figures

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources