Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Nov;180(3):1511-24.
doi: 10.1534/genetics.108.091116. Epub 2008 Sep 14.

Maximum-likelihood estimation of site-specific mutation rates in human mitochondrial DNA from partial phylogenetic classification

Collaborators, Affiliations

Maximum-likelihood estimation of site-specific mutation rates in human mitochondrial DNA from partial phylogenetic classification

Saharon Rosset et al. Genetics. 2008 Nov.

Abstract

The mitochondrial DNA hypervariable segment I (HVS-I) is widely used in studies of human evolutionary genetics, and therefore accurate estimates of mutation rates among nucleotide sites in this region are essential. We have developed a novel maximum-likelihood methodology for estimating site-specific mutation rates from partial phylogenetic information, such as haplogroup association. The resulting estimation problem is a generalized linear model, with a nonstandard link function. We develop inference and bias correction tools for our estimates and a hypothesis-testing approach for site independence. We demonstrate our methodology using 16,609 HVS-I samples from the Genographic Project. Our results suggest that mutation rates among nucleotide sites in HVS-I are highly variable. The 16,400-16,500 region exhibits significantly lower rates compared to other regions, suggesting potential functional constraints. Several loci identified in the literature as possible termination-associated sequences (TAS) do not yield statistically slower rates than the rest of HVS-I, casting doubt on their functional importance. Our tests do not reject the null hypothesis of independent mutation rates among nucleotide sites, supporting the use of site-independence assumption for analyzing HVS-I. Potential extensions of our methodology include its application to estimation of mutation rates in other genetic regions, like Y chromosome short tandem repeats.

PubMed Disclaimer

Figures

F<sc>igure</sc> 1.—
Figure 1.—
(A) Schematic of the Hg view of a phylogenetic tree and (B) the full phylogenetic tree, including the internal Hg phylogenies, which we assume we do not observe.
F<sc>igure</sc> 2.—
Figure 2.—
Demonstration of our reasoning, that we know whether any mutations have occurred in a specific site.
F<sc>igure</sc> 3.—
Figure 3.—
The poly-C stretch. Position 16,189 is highlighted and five sequences are shown. A sequence identical to the rCRS in the presented region is shown at the top. Below it, four sequences containing the T16,189C polymorphism are arranged to show one to four adenosines preceding the poly-C stretch. A typical chromatogram of the sequence after the poly-C stretch is also demonstrated.
F<sc>igure</sc> 4.—
Figure 4.—
Graphical representation of mutation rates along HVS-I.
F<sc>igure</sc> 5.—
Figure 5.—
Smoothed bias estimation curves for our various estimation protocols, using our simulation–bootstrap hybrid. The smoothing was done using LOESS (Cleveland et al. 1992).
F<sc>igure</sc> 5.—
Figure 5.—
Smoothed bias estimation curves for our various estimation protocols, using our simulation–bootstrap hybrid. The smoothing was done using LOESS (Cleveland et al. 1992).
F<sc>igure</sc> 5.—
Figure 5.—
Smoothed bias estimation curves for our various estimation protocols, using our simulation–bootstrap hybrid. The smoothing was done using LOESS (Cleveland et al. 1992).
F<sc>igure</sc> 5.—
Figure 5.—
Smoothed bias estimation curves for our various estimation protocols, using our simulation–bootstrap hybrid. The smoothing was done using LOESS (Cleveland et al. 1992).
F<sc>igure</sc> 6.—
Figure 6.—
Comparison of the estimates (black circles) and confidence intervals from two of our variants and Bandelt et al. (2006). Note that the y-axis is on a logarithmic scale.

References

    1. Bandelt, H., L. Quintana-Murci, A. Salas and V. Macaulay, 2002. The fingerprint of phantom mutations in mitochondrial DNA data. Am. J. Hum. Genet. 71(5): 1150–1160. - PMC - PubMed
    1. Bandelt, H. J., Q. P. Kong, M. Richards and V. Macaulay, 2006. Estimation of mutation rates and coalescence times: some caveats, pp. 47–90 in Human Mitochondrial DNA and the Evolution of Homo sapiens, edited by H. J. Bandelt, V. Macaulay and M. Richards. Springer, Berlin.
    1. Behar, D. M., S. Rosset, J. Blue-Smith, O. Balanovsky, S. Tzur et al., 2007. The genographic project public participation mitochondrial DNA database. PLoS Genet. 3(6): e104. - PMC - PubMed
    1. Benjamini, Y., and Y. Hochberg, 1995. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B 57 289–300.
    1. Bickel, P., F. Gotze and W. van Zwet, 1997. Resampling fewer than n observations: gains, losses and remedies for losses. Stat. Sin. 7 1–31.

Publication types

Substances