Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Feb 26;11(3):357.
doi: 10.3390/biom11030357.

Statistical Evidence for a Helical Nascent Chain

Affiliations

Statistical Evidence for a Helical Nascent Chain

Leonor Cruzeiro et al. Biomolecules. .

Abstract

We investigate the hypothesis that protein folding is a kinetic, non-equilibrium process, in which the structure of the nascent chain is crucial. We compare actual amino acid frequencies in loops, α-helices and β-sheets with the frequencies that would arise in the absence of any amino acid bias for those secondary structures. The novel analysis suggests that while specific amino acids exist to drive the formation of loops and sheets, none stand out as drivers for α-helices. This favours the idea that the α-helix is the initial structure of most proteins before the folding process begins.

Keywords: folding pathway; protein folding; secondary structure prediction; single amino acid distributions.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Average abundance, f¯a(a) (cf. Equation (3)), of amino acid a, in the protein set used. The values are given in percentage of the total number of amino acids (see text).
Figure 2
Figure 2
Distributions/histograms for a few of the frequencies f(a,s,p), where the amino acid a is specified at the top of each plot and where red is for s=α-helix, blue is for s=β-sheets and green is for s= loops. In this figure, the variable f(a,s,p) (the x-coordinate) runs from zero (which means that none of the amino acids a are found in s) to 30 (which means that 30% of the amino acids in the protein are a’s found in s). The y-coordinate is proportional to the number of proteins with a given value of f(a,s,p). The scale of the y-coordinate is the same for all plots and all distributions are normalised. The horizontal dotted lines are the FWHM of the distributions (see text).
Figure 3
Figure 3
Average frequency of finding an amino acid a in α-helices (top plot), β-strands (middle plot) and loops (bottom plot). The values are given in percentage for each secondary structure, i.e., summing all the values in each line leads to 100. The amino acids are specified by their one letter codes.
Figure 4
Figure 4
Distributions/histograms for a few of the ratios R(a,s,p) (cf. Equation (10)), where the amino acid a is specified at the top of each plot and where red is for s=α-helix, blue is for s=β-sheets and green is for s= loops. The variable R(a,s,p) (the x-coordinate) runs from zero to four, the scale of the ordinates is the same in all plots and all histograms are normalised. The vertical dotted line marks the value R(a,s,p)=1, when the actual number of amino acids a in secondary structure s is equal to what is expected in the absence of any correlation between a and s. The horizontal dotted lines are the FWHM of the distributions (see text).
Figure 5
Figure 5
Most probable values for the ratios R(a,s,p) (cf. Equation (10)) for α-helices (top plot, red), β-sheets (middle plot, blue) and loops (bottom plot, green). The most probable value is taken from the full distributions for each ratio R(a,s,p) and the uncertainty around that value is given by the FWHM.
Figure 6
Figure 6
Most probable values for the ratios R(a,s,p) (cf. Equation (10)) for α-helices (top plot, red), β-sheets (middle plot, blue) and loops (bottom plot, green). The most probable value is taken from the middle peak in the distributions for each ratio R(a,s,p) and the uncertainty around that value is given by the FWHM of that middle peak.

Similar articles

Cited by

References

    1. Chou P.Y., Fasman G.D. Conformational Parameters for Amino Acids in Helical, β-Sheet, and Random Coil Regions Calculated from Proteins. Biochemistry. 1974;3:211–222. doi: 10.1021/bi00699a001. - DOI - PubMed
    1. Haber E., Anfinsen C.B. Side-chain interactions governing the pairing of half-cystine residues in ribonuclease. J. Biol. Chem. 1962;237:1839–1844. doi: 10.1016/S0021-9258(19)73945-3. - DOI - PubMed
    1. Anfinsen C.B. Principles that Govern the Folding of Protein Chains. Science. 1973;181:223–230. doi: 10.1126/science.181.4096.223. - DOI - PubMed
    1. Dill K.A., Chan H.S. From Levinthal to pathways to funnels. Nat. Struct. Biol. 1997;4:10–19. doi: 10.1038/nsb0197-10. - DOI - PubMed
    1. Onuchic J.N., Luthey-Schulten Z., Wolynes P.G. Theory of protein folding: The energy landscape perspective. Annu. Rev. Phys. Chem. 1997;48:545–600. doi: 10.1146/annurev.physchem.48.1.545. - DOI - PubMed

Publication types

LinkOut - more resources