Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Nov 15;12(11):e1006315.
doi: 10.1371/journal.pgen.1006315. eCollection 2016 Nov.

Multi-nucleotide de novo Mutations in Humans

Affiliations

Multi-nucleotide de novo Mutations in Humans

Søren Besenbacher et al. PLoS Genet. .

Abstract

Mutation of the DNA molecule is one of the most fundamental processes in biology. In this study, we use 283 parent-offspring trios to estimate the rate of mutation for both single nucleotide variants (SNVs) and short length variants (indels) in humans and examine the mutation process. We found 17812 SNVs, corresponding to a mutation rate of 1.29 × 10-8 per position per generation (PPPG) and 1282 indels corresponding to a rate of 9.29 × 10-10 PPPG. We estimate that around 3% of human de novo SNVs are part of a multi-nucleotide mutation (MNM), with 558 (3.1%) of mutations positioned less than 20kb from another mutation in the same individual (median distance of 525bp). The rate of de novo mutations is greater in late replicating regions (p = 8.29 × 10-19) and nearer recombination events (p = 0.0038) than elsewhere in the genome.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. The correlation between the estimated mutation rate for each child and the age of the parents.
Fig 2
Fig 2. Clustering mutations.
(a) The red line shows a QQ-plot of the observed distances between all pairs of mutations (both within and between individuals) compared to the expected distances assuming independence. The green line shows a QQ plot based only on distances between mutations that occurred in the same individual. The blue line shows a QQ plot based only on distances between mutations that occurred in different individuals. (b) A histogram of the number of mutations per cluster. (c) Histogram showing the distribution of distances to the nearest mutation in the same individual.
Fig 3
Fig 3. Distribution of mutation types.
The relative distribution of different types of mutations stratified by the distance to nearest mutation in the same individual. The error bars are 95% confidence intervals.
Fig 4
Fig 4. Distribution of mutation types with sequence context.
The relative distribution of different types of mutation when the bases immediately 5’ and 3’ to the mutated base is included. Stratified based on whether the mutation clustered 10bp-20kb from another mutation or was not part of a cluster. The error bars are 95% confidence intervals.
Fig 5
Fig 5. The effect of replication timing on the mutation rate.
a) The effect of replication timing At CpG, non-CpG Strong (C or G) and Weak (A or T) sites. The y-values are mutation rates per position per year (PPPY). The x-values are wavelet-smoothed signal of replication timing calculated by the ENCODE project. Early replicating regions have high signal values and late replicating regions have low values. The bands around the points show the 95% confidence interval for each point. b) The effect of replication timing on clustering and non-clustering mutations. The y-axis shows a combined odds ratio for CpG, non-CpG-Strong and non-CpG-Weak sites calculated using the Cochran-Mantel-Haentzel method. An OR over 1 indicates that we observe most mutations in the latest replicating half of the genome. The error bars are 95% confidence intervals.

References

    1. Besenbacher S, Liu S, Izarzugaza JMG, Grove J, Belling K, Bork-Jensen J, et al. Novel variation and de novo mutation rates in population-wide de novo assembled Danish trios. Nat Comms. Nature Publishing Group; 2015;6: 5969–. - PMC - PubMed
    1. Kong A, Frigge ML, Masson G, Besenbacher S, Sulem P, Magnusson G, et al. Rate of de novo mutations and the importance of father’s age to disease risk. Nature. Nature Publishing Group; 2012;488: 471–475. 10.1038/nature11396 - DOI - PMC - PubMed
    1. Michaelson JJ, Shi Y, Gujral M, Zheng H, Malhotra D, Jin X, et al. Whole-genome sequencing in autism identifies hot spots for de novo germline mutation. Cell. 2012;151: 1431–1442. 10.1016/j.cell.2012.11.019 - DOI - PMC - PubMed
    1. Conrad DF, Keebler JEM, DePristo MA, Lindsay SJ, Zhang Y, Casals F, et al. Variation in genome-wide mutation rates within and between human families. Nature genetics. 2011;43: 712–714. 10.1038/ng.862 - DOI - PMC - PubMed
    1. Jiang Y-H, Yuen RKC, Jin X, Wang M, Chen N, Wu X, et al. Detection of Clinically Relevant Genetic Variants in Autism Spectrum Disorder by Whole-Genome Sequencing. The American Journal of Human Genetics. 2013;93: 249–263. 10.1016/j.ajhg.2013.06.012 - DOI - PMC - PubMed

LinkOut - more resources