Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2003 May;12(5):1073-86.
doi: 10.1110/ps.0236803.

Can correct protein models be identified?

Affiliations

Can correct protein models be identified?

Björn Wallner et al. Protein Sci. 2003 May.

Abstract

The ability to separate correct models of protein structures from less correct models is of the greatest importance for protein structure prediction methods. Several studies have examined the ability of different types of energy function to detect the native, or native-like, protein structure from a large set of decoys. In contrast to earlier studies, we examine here the ability to detect models that only show limited structural similarity to the native structure. These correct models are defined by the existence of a fragment that shows significant similarity between this model and the native structure. It has been shown that the existence of such fragments is useful for comparing the performance between different fold recognition methods and that this performance correlates well with performance in fold recognition. We have developed ProQ, a neural-network-based method to predict the quality of a protein model that extracts structural features, such as frequency of atom-atom contacts, and predicts the quality of a model, as measured either by LGscore or MaxSub. We show that ProQ performs at least as well as other measures when identifying the native structure and is better at the detection of correct models. This performance is maintained over several different test sets. ProQ can also be combined with the Pcons fold recognition predictor (Pmodeller) to increase its performance, with the main advantage being the elimination of a few high-scoring incorrect models. Pmodeller was successful in CASP5 and results from the latest LiveBench, LiveBench-6, indicating that Pmodeller has a higher specificity than Pcons alone.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Fraction of similarity between predicted secondary structure and the secondary structure in the model (Q3) plotted against predicted LGscore, for networks trained with Q3 (A) and without Q3 (B). The networks trained with Q3 tend to give low scores to models with low Q3 and higher scores in the region Q3 ∈ {0.7, 0.8}
Figure 2.
Figure 2.
Distribution of LGscore (A), MaxSub (B), and RMSD (C) for the training set (LiveBench-2).
Figure 3.
Figure 3.
Cumulative plot of incorrect versus correct models for different methods on the LiveBench-2 (A) and LiveBench-4 (B) data sets. The curves were smoothed using averages of seven consecutive points.
Figure 4.
Figure 4.
Cumulative plot of incorrect versus correct models for different methods combined with Pcons on the LiveBench-2 test set. To make the curves easier to analyze, they were smoothed by using averages.

References

    1. Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J. 1997. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25 3389–3402. - PMC - PubMed
    1. Bauer, A. and Beyer, A. 1994. An improved pair potential to recognize native protein folds. Proteins 18 254–261. - PubMed
    1. Bishop, C.M. 1995. Neural networks for pattern recognition. Oxford University Press, Oxford, UK.
    1. Bowie, J.U. and Eisenberg, D. 1994. An evolutionary approach to folding small α-helical proteins that uses sequence information and an empirical guiding fitness function. Proc. Natl. Acad. Sci. 91 4436–4440. - PMC - PubMed
    1. Bowie, J.U., Lüthy, R., and Eisenberg, D. 1991. A method to identify protein sequence that fold into a known three-dimensional structure. Science 253 164–170. - PubMed