Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Jun 4;9(6):e95034.
doi: 10.1371/journal.pone.0095034. eCollection 2014.

Unraveling patterns of site-to-site synonymous rates variation and associated gene properties of protein domains and families

Affiliations

Unraveling patterns of site-to-site synonymous rates variation and associated gene properties of protein domains and families

Slavica Dimitrieva et al. PLoS One. .

Erratum in

  • PLoS One. 2014;9(7):e102721

Abstract

In protein-coding genes, synonymous mutations are often thought not to affect fitness and therefore are not subject to natural selection. Yet increasingly, cases of non-neutral evolution at certain synonymous sites were reported over the last decade. To evaluate the extent and the nature of site-specific selection on synonymous codons, we computed the site-to-site synonymous rate variation (SRV) and identified gene properties that make SRV more likely in a large database of protein-coding gene families and protein domains. To our knowledge, this is the first study that explores the determinants and patterns of the SRV in real data. We show that the SRV is widespread in the evolution of protein-coding sequences, putting in doubt the validity of the synonymous rate as a standard neutral proxy. While protein domains rarely undergo adaptive evolution, the SRV appears to play important role in optimizing the domain function at the level of DNA. In contrast, protein families are more likely to evolve by positive selection, but are less likely to exhibit SRV. Stronger SRV was detected in genes with stronger codon bias and tRNA reusage, those coding for proteins with larger number of interactions or forming larger number of structures, located in intracellular components and those involved in typically conserved complex processes and functions. Genes with extreme SRV show higher expression levels in nearly all tissues. This indicates that codon bias in a gene, which often correlates with gene expression, may often be a site-specific phenomenon regulating the speed of translation along the sequence, consistent with the co-translational folding hypothesis. Strikingly, genes with SRV were strongly overrepresented for metabolic pathways and those associated with several genetic diseases, particularly cancers and diabetes.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Bootstrap distribution of the differences in A) the mean ω-ratio, B) tRNA reusage, measured through tRNA Pairing Index (TPI), C) number of interactions and D) number of structures, between protein groups having site-to-site variation in synonymous rates (SRV+) and protein groups having constant synonymous rates (SRV−).
The plots B), C) and D) also show the bootstrap distributions of the corresponding differences between protein groups showing evidence for positive selection (PS+) and those failing to show such evidence (PS−). All differences (except for TPI in PS+/PS− data) are significant since 95% of the histogram area does not include the zero value.
Figure 2
Figure 2. Hierarchical clustering of human disease and environmental information processing pathways in respect to the SRV+ genes that are shared between the pathways.
The bars next to the pathways denote the number of SRV+ genes (red) and SRV- genes (green) in the corresponding pathways. Cancer related pathways are marked in blue; metabolic disease pathways are in purple. Note that ABC transporters and Type II diabetes mellitus pathways are exclusively composed of SRV+ genes.
Figure 3
Figure 3. Distribution of the expression levels in A) SRV− genes (blue) and SRVEXT genes (red) and B) PS− genes (green) and PS+ genes (purple) for different tissues.
SRVEXT genes show higher expression levels compared to SRV− genes; PS+ genes show reduced expression levels compared to PS− genes.

References

    1. Anfinsen CB (1973) Principles that govern the folding of protein chains. Science 181: 223–230. - PubMed
    1. Clarke B (1970) Darwinian evolution of proteins. Science 168: 1009–1011. - PubMed
    1. Ikemura T (1981) Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes. J Mol Biol 146: 1–21. - PubMed
    1. Akashi H, Eyre-Walker A (1998) Translational selection and molecular evolution. Curr Opin Genet Dev 8: 688–693. - PubMed
    1. Duret L (2002) Evolution of synonymous codon usage in metazoans. Curr Opin Genet Dev 12: 640–649. - PubMed

Publication types