Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2003 May;13(5):838-44.
doi: 10.1101/gr.1152803.

Neutral substitutions occur at a faster rate in exons than in noncoding DNA in primate genomes

Affiliations
Comparative Study

Neutral substitutions occur at a faster rate in exons than in noncoding DNA in primate genomes

Sankar Subramanian et al. Genome Res. 2003 May.

Abstract

Point mutation rates in exons (synonymous sites) and noncoding (introns and intergenic) regions are generally assumed to be the same. However, comparative sequence analyses of synonymous substitutions in exons (81 genes) and that of long intergenic fragments (141.3 kbp) of human and chimpanzee genomes reveal a 30%-60% higher mutation rate in exons than in noncoding DNA. We propose a differential CpG content hypothesis to explain this fundamental, and seemingly unintuitive, pattern. We find that the increased exonic rate is the result of the relative overabundance of synonymous sites involved in CpG dinucleotides, as the evolutionary divergence in non-CpG sites is similar in noncoding DNA and synonymous sites of exons. Expectations and predictions of our hypothesis are confirmed in comparisons involving more distantly related species, including human-orangutan, human-baboon, and human-macaque. Our results suggest an underlying mechanism for higher mutation rate in GC-rich genomic regions, predict nonlinear accumulation of mutations in pseudogenes over time, and provide a possible explanation for the observed higher diversity of single nucleotide polymorphisms (SNPs) in the synonymous sites of exons compared to the noncoding regions.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
(A) Evolutionary divergence per 100 sites and (B) CpG contents of synonymous sites in exons, pseudogenes, introns, and intergenic DNA for the human–chimpanzee comparison. The mean and the standard error estimates shown were obtained from 81 protein coding genes, 19 pseudogenes, introns of 48 genes, and four intergenic blocks. In (A), the total height of each column (including black and open bars) corresponds to the overall evolutionary divergence. The black portion in each column depicts divergence with CpG sites excluded, and therefore, the open bars show the contribution of mutations at CpG sites to the overall divergence. In (B), the CpG content is the proportion of sites involved in the CpG dinucleotide configuration. For noncoding DNA, all sites were included for the estimation of CpG content and evolutionary divergence, whereas for exons only the fourfold-degenerate sites were included (see Methods).
Figure 2.
Figure 2.
Relationship between evolutionary divergence and the ratio of CpG contents in pseudogenes and their functional counterparts. (A) Analysis of 39 human pseudogenes [R2 = 0.52; P < 0.01], (B) seven pseudogenes created from the same Arginino succinate synthetase gene [R2 = 0.90; P < 0.01], and (C) five pseudogenes of the same Beta tubulin Q gene [R2 = 0.73; P < 0.05]. The relative CpG content is the ratio of the overall CpG content in the pseudogene divided by overall CpG content of the functional gene.
Figure 3.
Figure 3.
(A) Relationship between evolutionary divergence and GC content, GC4, at synonymous sites of 93 genes from the human–macaque comparison; R2 = 0.125, P < 0.01. (B) Relationship between evolutionary divergence and GC4 after excluding fourfold-degenerate sites involved in CpG dinucleotides; R2 = 0.01, P > 0.05.
Figure 4.
Figure 4.
(A) Relationship between synonymous divergence in exons and evolutionary divergence in introns; R2 = 0.511, P < 0.01. (B) Relationship between synonymous divergence in exons and evolutionary divergence in introns after removing CpG sites; R2 = 0.137, P > 0.05. With the rightmost outlier excluded, R2 = 0.06, P > 0.05.

Similar articles

Cited by

References

    1. Batzer M.A., Kilroy, G.E., Richard, P.E., Shaikh, T.H., Desselle, T.D., Hoppens, C.L., and Deininger, P.L. 1990. Structure and variability of recently inserted Alu family members. Nucleic Acids Res. 18: 6793-6798. - PMC - PubMed
    1. Bergstrom T.F., Erlandsson, R., Engkvist, H., Josefsson, A., Erlich, H.A., and Gyllensten, U. 1999. Phylogenetic history of hominoid DRB loci and alleles inferred from intron sequences. Immunol. Rev. 167: 351-365. - PubMed
    1. Bielawski J.P., Dunn, K.A., and Yang, Z. 2000. Rates of nucleotide substitution and mammalian nuclear gene evolution. Approximate and maximum-likelihood methods lead to different conclusions. Genetics 156: 1299-1308. - PMC - PubMed
    1. Bird A. 1999. DNA methylation de novo. Science 286: 2287-2288. - PubMed
    1. Bird A.P. 1980. DNA methylation and the frequency of CpG in animal DNA. Nucleic Acids Res. 8: 1499-1504. - PMC - PubMed

Publication types

Substances

LinkOut - more resources