Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jul;29(7):609-18.
doi: 10.1007/s10822-015-9839-2. Epub 2015 Mar 26.

Statistical dictionaries for hypothetical in silico model of the early-stage intermediate in protein folding

Affiliations

Statistical dictionaries for hypothetical in silico model of the early-stage intermediate in protein folding

Barbara Kalinowska et al. J Comput Aided Mol Des. 2015 Jul.

Abstract

The polypeptide chain folding process appears to be a multi-stage phenomenon. The scientific community has recently devoted much attention to early stages of this process, with numerous attempts at simulating them--either experimentally or in silico. This paper presents a comparative analysis of the predicted and observed results of folding simulations. The proposed technique, based on statistical dictionaries, yields a global accuracy of 57%--a marked improvement over older approaches (with an accuracy of approximately 46%).

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Conformational subspace represented by an elliptical path on the Ramachandran plot (gray line), with areas corresponding to local probability distribution maxima of (φ e, ψ e) angle pairs obtained through minimum-distance projections (black lines). The black arrow depicts a sample projection (φ i, ψ i) → (φ ei, ψ ei)
Fig. 2
Fig. 2
Probability distribution profiles for (φ e, ψ e) values for histidine, along with structural codes corresponding to individual maxima. The t parameter traverses the elliptical path starting with one of its poles which is located in the bottom right-hand corner of the Ramachandran plot. Bars represent the specific placement of all seven probability maxima which are used to express the early stage intermediate structure
Fig. 3
Fig. 3
Comparison of prediction accuracy between statistical dictionaries method (DIC) and contingency table approach (based on the Maximum Probability in contingency table—MP) for amino acid residues and individual structural codes. The list of residues is given on the bottom line. The zones on Ramachandran map is represented according to symbols ag. The c—represents the helical area, e and f the β-structural forms and g—left helical area. The codes a, b and d traditionally are treated as Random Coil
Fig. 4
Fig. 4
2VBL structure (A chain) a native structure derived from PDB, b structure obtained by projecting each (φ, ψ) angle pair onto the elliptical path which represents the ES conformational subspace, c ES structure obtained using the statistical dictionary method, d ES structure obtained using the contingency table method. Blue, red and green fragments correspond to residues which form α-helixes, β-twists and loops respectively. Source: PyMOL
Fig. 5
Fig. 5
2JEK structure (A chain) a native structure derived from PDB, b structure obtained by projecting each (φ, ψ) angle pair onto the elliptical path which represents the ES conformational subspace, c ES structure obtained using the statistical dictionary method, d ES structure obtained using the contingency table method. Blue, red and green fragments correspond to residues which form α-helixes, β-twists and loops respectively. Source: PyMOL
Fig. 6
Fig. 6
2VAD structure (A chain) a native structure derived from PDB, b structure obtained by projecting each (φ, ψ) angle pair onto the elliptical path which represents the ES conformational subspace, c ES structure obtained using the statistical dictionary method, d ES structure obtained using the contingency table method. Blue, red and green fragments correspond to residues which form α-helixes, β-twists and loops respectively. Source: PyMOL

References

    1. Bystroff C, Shao Y. Modeling protein folding pathways. In: Bujnicki J, editor. Practical bioinformatics. Berlin: Springer; 2004. pp. 97–122.
    1. Feng H, Zhou Z, Bai Y. A protein folding pathway with multiple folding intermediates at atomic resolution. Proc Natl Acad Sci USA. 2005;102:5026–5031. doi: 10.1073/pnas.0501372102. - DOI - PMC - PubMed
    1. Galzitskaya OV, Ivankov DN, Finkelstein AV. Folding nuclei in proteins. FEBS Lett. 2001;489:113–118. doi: 10.1016/S0014-5793(01)02092-0. - DOI - PubMed
    1. Grantcharova VP, Baker D. Folding dynamics of the src SH3 domain. Biochemistry. 1997;36:15685–15692. doi: 10.1021/bi971786p. - DOI - PubMed
    1. Jha SK, Marqusee S. Kinetic evidence for a two-stage mechanism of protein denaturation by guanidinium chloride. Proc Natl Acad Sci USA. 2014;111:4856–4861. doi: 10.1073/pnas.1315453111. - DOI - PMC - PubMed

Publication types

LinkOut - more resources