Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2003 Oct 15;31(20):6043-52.
doi: 10.1093/nar/gkg784.

A compositional segmentation of the human mitochondrial genome is related to heterogeneities in the guanine mutation rate

Affiliations

A compositional segmentation of the human mitochondrial genome is related to heterogeneities in the guanine mutation rate

David C Samuels et al. Nucleic Acids Res. .

Abstract

We applied a hidden Markov model segmentation method to the human mitochondrial genome to identify patterns in the sequence, to compare these patterns to the gene structure of mtDNA and to see whether these patterns reveal additional characteristics important for our understanding of genome evolution, structure and function. Our analysis identified three segmentation categories based upon the sequence transition probabilities. Category 2 segments corresponded to the tRNA and rRNA genes, with a greater strand-symmetry in these segments. Category 1 and 3 segments covered the protein- coding genes and almost all of the non-coding D-loop. Compared to category 1, the mtDNA segments assigned to category 3 had much lower guanine abundance. A comparison to two independent databases of mitochondrial mutations and polymorphisms showed that the high substitution rate of guanine in human mtDNA is largest in the category 3 segments. Analysis of synonymous mutations showed the same pattern. This suggests that this heterogeneity in the mutation rate is partly independent of respiratory chain function and is a direct property of the genome sequence itself. This has important implications for our understanding of mtDNA evolution and its use as a 'molecular clock' to determine the rate of population and species divergence.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Sequence transition probability distributions between nucleotides for the three segmentation categories. Red: category 1. Blue: category 2. Green: category 3.
Figure 2
Figure 2
Probability map of assignment of the human mtDNA genome to the three segmentation categories. Protein gene annotations are given in (a) for categories 1 and 3. tRNA and rRNA annotations are given in (b) for category 2. The arrows give the direction of translation of the gene. Color codes are the same as in Figure 1.
Figure 3
Figure 3
A comparison of the positions of the category 3 segments, shaded in green, to (A) guanine distribution and (B) predicted hydrophobicity. The guanine abundance was calculated using a sliding window of length 201 bp. The predicted hydrophobicity was calculated using the Kyte-Doolittle scale averaged over a sliding window of 21 amino acids and is plotted as a function of the base pair position of the codons for each amino acid. Due to the rapid variations in predicted hydrophobicity, only three genes (ND1, ND2 and COX I) are plotted. No relationship between the predicted protein hydrophobicity and the category 3 locations is apparent in this plot, but the location of the category 3 segments clearly corresponds to sequence regions with low guanine abundance. This plot shows that the low guanine content of category 3 is true for each segment individually, not just for the average taken over all the segments.
Figure 3
Figure 3
A comparison of the positions of the category 3 segments, shaded in green, to (A) guanine distribution and (B) predicted hydrophobicity. The guanine abundance was calculated using a sliding window of length 201 bp. The predicted hydrophobicity was calculated using the Kyte-Doolittle scale averaged over a sliding window of 21 amino acids and is plotted as a function of the base pair position of the codons for each amino acid. Due to the rapid variations in predicted hydrophobicity, only three genes (ND1, ND2 and COX I) are plotted. No relationship between the predicted protein hydrophobicity and the category 3 locations is apparent in this plot, but the location of the category 3 segments clearly corresponds to sequence regions with low guanine abundance. This plot shows that the low guanine content of category 3 is true for each segment individually, not just for the average taken over all the segments.

References

    1. Anderson S., Bankier,A.T., Barrell,B.G., de Bruijn,M.H., Coulson,A.R., Drouin,J., Eperon,I.C., Nierlich,D.P., Roe,B.A., Sanger,F. et al. (1981) Sequence and organization of the human mitochondrial genome. Nature, 290, 457–465. - PubMed
    1. Mishmar D., Ruiz-Pesini,E., Golik,P., Macaulay,V., Clark,A.G., Hosseini,S., Brandon,M., Easley,K., Chen,E., Brown,M.D. et al. (2003) Natural selection shaped regional mtDNA variation in humans. Proc. Natl Acad. Sci. USA, 100, 171–176. - PMC - PubMed
    1. Wallace D.C. (1994) Mitochondrial DNA sequence variation in human evolution and disease. Proc. Natl Acad. Sci. USA, 91, 8739–8746. - PMC - PubMed
    1. Wilson M.R., DiZinno,J.A., Polanskey,D., Replogle,J. and Budowle,B. (1995) Validation of mitochondrial DNA sequencing for forensic casework analysis. Int. J. Legal Med., 108, 68–74. - PubMed
    1. Wallace D.C. (1999) Mitochondrial diseases in mouse and man. Science, 283, 1482–1488. - PubMed

Publication types