Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Sep 10;15(9):15963-80.
doi: 10.3390/ijms150915963.

Neighbor preferences of amino acids and context-dependent effects of amino acid substitutions in human, mouse, and dog

Affiliations

Neighbor preferences of amino acids and context-dependent effects of amino acid substitutions in human, mouse, and dog

Mingchuan Fu et al. Int J Mol Sci. .

Abstract

Amino acids show apparent propensities toward their neighbors. In addition to preferences of amino acids for their neighborhood context, amino acid substitutions are also considered to be context-dependent. However, context-dependence patterns of amino acid substitutions still remain poorly understood. Using relative entropy, we investigated the neighbor preferences of 20 amino acids and the context-dependent effects of amino acid substitutions with protein sequences in human, mouse, and dog. For 20 amino acids, the highest relative entropy was mostly observed at the nearest adjacent site of either N- or C-terminus except C and G. C showed the highest relative entropy at the third flanking site and periodic pattern was detected at G flanking sites. Furthermore, neighbor preference patterns of amino acids varied greatly in different secondary structures. We then comprehensively investigated the context-dependent effects of amino acid substitutions. Our results showed that nearly half of 380 substitution types were evidently context dependent, and the context-dependent patterns relied on protein secondary structures. Among 20 amino acids, P elicited the greatest effect on amino acid substitutions. The underlying mechanisms of context-dependent effects of amino acid substitutions were possibly mutation bias at a DNA level and natural selection. Our findings may improve secondary structure prediction algorithms and protein design; moreover, this study provided useful information to develop empirical models of protein evolution that consider dependence between residues.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Neighbor preference pattern of the amino acid A. (A) Relative entropies of the neighboring sites of A of N- and C-termini. The vertical coordinate represents the value of relative entropy, and the horizontal coordinate represents the relative distance to the 0 site. The flanking sites of N-terminus are indicated by negative numbers and the flanking sites of C-terminus are indicated by positive numbers. The gray lines are the thresholds representing 0.001 significance level. The standard deviation of the relative entropy in each site is indicated by black error bars; (B) Boxplot of the 20 relative entropies calculated by formula image at each flanking site of the amino acid A. The bold line in each box represents the median of the 20 values. The top and bottom lines of each box indicate the upper and lower quartiles, respectively. The upper and lower whiskers represent the largest data point which was less than the sum of the upper quartile plus 1.5 times the interquartile range (IQR), and the lowest data point which was greater than the lower quartile minus 1.5 IQR, respectively. In order to determine which amino acids were apparently preferred (or not preferred) in the neighboring sites, the outliers were represented by the corresponding one letter codes of amino acids.
Figure 2
Figure 2
Neighbor preference pattern of the amino acid C. (A) Relative entropies of the neighboring sites of C of N- and C-termini; (B) Boxplot of the 20 relative entropies at each flanking site of the amino acid C.
Figure 3
Figure 3
Neighbor preference pattern of the amino acid G. (A) Relative entropies of the neighboring sites of G of N- and C-termini; (B) Boxplot of the 20 relative entropies at each flanking site of the amino acid G.
Figure 4
Figure 4
Neighbor preference patterns of V, T, and E in different secondary structures. (A) Neighbor preference pattern of the amino acid V in α-helix; (B) Neighbor preference pattern of the amino acid T in β-strand; and (C) Neighbor preference pattern of the amino acid E in coil. (Boxplots of these amino acids can be found in Figures S2–S4).
Figure 5
Figure 5
Context-dependence pattern of SP substitution. (A) Relative entropies of the neighboring sites of the SP substitution of N- and C-termini; (B) Boxplot of the 20 relative entropies at each flanking site of the SP substitution.
Figure 6
Figure 6
Context-dependence pattern of GE substitution in α-helix and coil. (A) Context-dependence pattern of the GE substitution in α-helix; (B) Context-dependence pattern of the GE substitution in coil. (Boxplots of this amino acid substitution type in α-helix and coil can be found in Figures S6 and S7).

Similar articles

Cited by

References

    1. Dwyer D.S. Nearest-neighbor effects and structural preferences in dipeptides are a function of the electronic properties of amino acid side-chains. Proteins. 2006;63:939–948. doi: 10.1002/prot.20906. - DOI - PubMed
    1. Chou P.Y., Fasman G.D. Conformational parameters for amino acids in helical, beta-sheet, and random coil regions calculated from proteins. Biochemistry. 1974;13:211–222. doi: 10.1021/bi00699a001. - DOI - PubMed
    1. Chou P.Y. Prediction of the secondary structure of proteins from their amino acid sequence. Adv. Enzymol. Relat. Areas Mol. Biol. 1978;47:45–148. - PubMed
    1. Chou P.Y., Fasman G.D. Empirical predictions of protein conformation. Annu. Rev. Biochem. 1978;47:251–276. doi: 10.1146/annurev.bi.47.070178.001343. - DOI - PubMed
    1. Chakrabartty A., Kortemme T., Baldwin R.L. Helix propensities of the amino acids measured in alanine-based peptides without helix-stabilizing side-chain interactions. Protein Sci. 1994;3:843–852. doi: 10.1002/pro.5560030514. - DOI - PMC - PubMed

Publication types

LinkOut - more resources