Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jan 12:11:e73577.
doi: 10.7554/eLife.73577.

The Mutationathon highlights the importance of reaching standardization in estimates of pedigree-based germline mutation rates

Affiliations

The Mutationathon highlights the importance of reaching standardization in estimates of pedigree-based germline mutation rates

Lucie A Bergeron et al. Elife. .

Abstract

In the past decade, several studies have estimated the human per-generation germline mutation rate using large pedigrees. More recently, estimates for various nonhuman species have been published. However, methodological differences among studies in detecting germline mutations and estimating mutation rates make direct comparisons difficult. Here, we describe the many different steps involved in estimating pedigree-based mutation rates, including sampling, sequencing, mapping, variant calling, filtering, and appropriately accounting for false-positive and false-negative rates. For each step, we review the different methods and parameter choices that have been used in the recent literature. Additionally, we present the results from a 'Mutationathon,' a competition organized among five research labs to compare germline mutation rate estimates for a single pedigree of rhesus macaques. We report almost a twofold variation in the final estimated rate among groups using different post-alignment processing, calling, and filtering criteria, and provide details into the sources of variation across studies. Though the difference among estimates is not statistically significant, this discrepancy emphasizes the need for standardized methods in mutation rate estimations and the difficulty in comparing rates from different studies. Finally, this work aims to provide guidelines for computational and statistical benchmarks for future studies interested in identifying germline mutations from pedigrees.

Keywords: computational pipeline; evolutionary biology; genetics; genomics; mutation rate; ngs analysis; pedigree-based estimation; rhesus macaque.

PubMed Disclaimer

Conflict of interest statement

LB, SB, TT, CV, RW, AP, EA, MR, JC, HC, MH, KH, AK, EL, PM, SP, GT, AY, GZ, MS No competing interests declared

Figures

Figure 1.
Figure 1.. Detection of a de novo mutation (DNM) in a trio sample (mother, father, and offspring).
Potential candidates for DNMs are sites where approximately half of the reads (indicated as gray bars) from the offspring have a variant (indicated in green) that is absent from the parental reads.
Figure 2.
Figure 2.. Flow of the main steps to call de novo mutations (DNMs) from pedigree samples.
Each step lists the various choices in study design and methodology that might impact mutation rate estimates.
Figure 3.
Figure 3.. Candidate de novo mutations (DNMs) from the Mutationathon.
(a) The pedigree of three generations of rhesus macaques was sequenced and shared with five groups of researchers. Sequencing coverage is indicated for each individual. (b) Upset plot of the 43 candidate DNMs found in Heineken by each research group (LB: Lucie Bergeron; SB: Søren Besenbacher; CV: Cyril Versoza; TT: Tychele Turner; RW: Richard Wang) detected a total of 43 candidate DNMs in Heineken. The first six vertical bars are the candidates shared by at least four different groups. The PCR amplification and Sanger sequencing validation showed that 33 candidates were true-positive DNMs, 6 were false-positive calls (red bars), and 4 did not successfully amplify (gray bars). See Materials and methods for details on the experiment and Figure 3—source data 2 for the results of the PCR experiment.
Figure 3—figure supplement 1.
Figure 3—figure supplement 1.. Mutation spectrum of the trio of rhesus macaques.
’All TPs’ correspond to all true-positive de novo mutations (DNMs) validated by the PCR experiment. The different colors correspond to the true-positive DNMs found by each pipeline (LB: Lucie Bergeron; SB: Søren Besenbacher; CV: Cyril Versoza; TT: Tychele Turner; RW: Richard Wang).
Figure 4.
Figure 4.. Estimated germline mutation rates from the Mutationathon.
(a) Number of candidate de novo mutations (DNMs) found by each group (Tychele Turner found two candidates on a sex chromosome). (b) Estimation of the denominator (i.e., the callable genome corrected by the false-negative rate [FNR]) by each group. (c) Estimated mutation rate per site per generation, the error bars correspond to the confidence intervals for binomial probabilities (calculated using the R package 'binconf').
Figure 5.
Figure 5.. The impact of individual filters on the estimated rate of a trio of rhesus macaques.
The default filters used by Lucie Bergeron (LB) pipeline were DP < 0.5 × depth individual; DP > 2 × depthindividual; GQ < 60; AB < 0.3; AB > 0.7, no AD filter.

References

    1. Acinas SG, Sarma-Rupavtarm R, Klepac-Ceraj V, Polz MF. PCR-induced sequence artifacts and bias: insights from comparison of two 16S rRNA clone libraries constructed from the same sample. Applied and Environmental Microbiology. 2005;71:8966–8969. doi: 10.1128/AEM.71.12.8966-8969.2005. - DOI - PMC - PubMed
    1. Acuna-Hidalgo R, Veltman JA, Hoischen A. New insights into the generation and role of de novo mutations in health and disease. Genome Biology. 2016;17:241. doi: 10.1186/s13059-016-1110-1. - DOI - PMC - PubMed
    1. Baust JG. Strategies for the storage of DNA. Biopreservation and Biobanking. 2008;6:251–252. doi: 10.1089/bio.2008.0604.lett. - DOI - PubMed
    1. Belyeu JR, Brand H, Wang H, Zhao X, Pedersen BS, Feusier J, Gupta M, Nicholas TJ, Brown J, Baird L, Devlin B, Sanders SJ, Jorde LB, Talkowski ME, Quinlan AR. De novo structural mutation rates and gamete-of-origin biases revealed through genome sequencing of 2,396 families. American Journal of Human Genetics. 2021;108:597–607. doi: 10.1016/j.ajhg.2021.02.012. - DOI - PMC - PubMed
    1. Bergeron LA, Besenbacher S, Bakker J, Zheng J, Li P, Pacheco G, Sinding MHS, Kamilari M, Gilbert MTP, Schierup MH, Zhang G. The germline mutational process in rhesus macaque and its implications for phylogenetic dating. GigaScience. 2021a;10:1–14. doi: 10.1093/gigascience/giab029. - DOI - PMC - PubMed

Publication types