Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Mar;79(3):839-52.
doi: 10.1002/prot.22922. Epub 2010 Dec 6.

Predicting protein flexibility through the prediction of local structures

Affiliations

Predicting protein flexibility through the prediction of local structures

Aurélie Bornot et al. Proteins. 2011 Mar.

Abstract

Protein structures are valuable tools for understanding protein function. However, protein dynamics is also considered a key element in protein function. Therefore, in addition to structural analysis, fully understanding protein function at the molecular level now requires accounting for flexibility. However, experimental techniques that produce both types of information simultaneously are still limited. Prediction approaches are useful alternative tools for obtaining otherwise unavailable data. It has been shown that protein structure can be described by a limited set of recurring local structures. In this context, we previously established a library composed of 120 overlapping long structural prototypes (LSPs) representing fragments of 11 residues in length and covering all known local protein structures. On the basis of the close sequence-structure relationship observed in LSPs, we developed a novel prediction method that proposes structural candidates in terms of LSPs along a given sequence. The prediction accuracy rate was high given the number of structural classes. In this study, we use this methodology to predict protein flexibility. We first examine flexibility according to two different descriptors, the B-factor and root mean square fluctuations from molecular dynamics simulations. We then show the relevance of using both descriptors together. We define three flexibility classes and propose a method based on the LSP prediction method for predicting flexibility along the sequence. The prediction rate reaches 49.6%. This method competes rather efficiently with the most recent, cutting-edge methods based on true flexibility data learning with sophisticated algorithms. Accordingly, flexibility information should be taken into account in structural prediction assessments.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Normalised B-factor values according to normalized RMSF values as determined from molecular dynamics simulations
The two diagonal lines delimit the three flexibility classes defined by the quadruplet (τB1, τF1, τB2, τF2) = (−1.5, −0.5, 2.2, 1.1)
Figure 2
Figure 2. Relationship between the mean B-factorNorm mB and the mean RMSFNorm mF per LSP class
Dot colour represents the secondary structure LSP category, with helical, extended core, connection and extended edge LSPs in black, red, green and blue, respectively. The black line is the first bisector. The brown dashed line gives the regression line.
Figure 3
Figure 3. Relationship between observed flexibility and prediction rate for each LSP class
Flexibility was measured by the mean RMSFNorm mF. Dot colour represents the secondary structure LSP category, with helical, extended, connection and extended edge LSPs in black, red, green and blue, respectively. The brown dashed line gives regression line.
Figure 4
Figure 4. Flexibility prediction on the rat intestinal fatty acid-binding protein
(PDB code 1IFC, 131 residues). Top: observed and predicted B-factorNorm values, bottom: observed and predicted RMSFNorm values. Black dotted lines indicated observed values and red lines indicate predicted values. Outlier values are symbolised by triangles.
Figure 5
Figure 5. Observed and predicted flexibility descriptors mapped on the rat intestinal fatty acid-binding protein structure
(PDB code 1IFC). A. Normalized B-factors, B. Predicted B-factors, C. Normalized RMSF from molecular dynamics simulations, D. Predicted RMSF.
Figure 6
Figure 6. Flexibility and prediction rate according to the local structure prediction confidence index (CI)
Flexibility descriptors are given on the left y-axis. RMSFNorm and B-factorNorm averages according to CI categories are indicated by blue and green lines, respectively. Prediction rates are given on the right y-axis. Local structure prediction rates according to CI are indicated in black and flexibility prediction rates obtained with the quadruplet defined on the whole dataset are in red.

References

    1. Noble ME, Endicott JA, Johnson LN. Protein kinase inhibitors: insights into drug design from structure. Science. 2004;303:1800–1805. - PubMed
    1. Blundell TL, Sibanda BL, Montalvao RW, Brewerton S, Chelliah V, Worth CL, Harmer NJ, Davies O, Burke D. Structural biology and bioinformatics in drug design: opportunities and challenges for target identification and lead discovery. Philos Trans R Soc Lond B Biol Sci. 2006;361:413–423. - PMC - PubMed
    1. Wendt KU, Weiss MS, Cramer P, Heinz DW. Structures and diseases. Nat Struct Mol Biol. 2008;15:117–120. - PMC - PubMed
    1. Eisenmesser EZ, Millet O, Labeikovsky W, Korzhnev DM, Wolf-Watz M, Bosco DA, Skalicky JJ, Kay LE, Kern D. Intrinsic dynamics of an enzyme underlies catalysis. Nature. 2005;438:117–121. - PubMed
    1. Boehr DD, McElheny D, Dyson HJ, Wright PE. The dynamic energy landscape of dihydrofolate reductase catalysis. Science. 2006;313:1638–1642. - PubMed

Publication types