Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2019 Mar 15;35(6):937-944.
doi: 10.1093/bioinformatics/bty760.

Comparative analysis of methods for evaluation of protein models against native structures

Affiliations
Comparative Study

Comparative analysis of methods for evaluation of protein models against native structures

Kliment Olechnovič et al. Bioinformatics. .

Abstract

Motivation: Measuring discrepancies between protein models and native structures is at the heart of development of protein structure prediction methods and comparison of their performance. A number of different evaluation methods have been developed; however, their comprehensive and unbiased comparison has not been performed.

Results: We carried out a comparative analysis of several popular model assessment methods (RMSD, TM-score, GDT, QCS, CAD-score, LDDT, SphereGrinder and RPF) to reveal their relative strengths and weaknesses. The analysis, performed on a large and diverse model set derived in the course of three latest community-wide CASP experiments (CASP10-12), had two major directions. First, we looked at general differences between the scores by analyzing distribution, correspondence and correlation of their values as well as differences in selecting best models. Second, we examined the score differences taking into account various structural properties of models (stereochemistry, hydrogen bonds, packing of domains and chain fragments, missing residues, protein length and secondary structure). Our results provide a solid basis for an informed selection of the most appropriate score or combination of scores depending on the task at hand.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Empirical distribution of score values on the single-domain dataset. Horizontal axis indicates score values, vertical axis—frequency of the value occurrence. For RMSD/tRMSD, GDT and CAD-score only representative versions are shown [Cα RMSD/tRMSD, GDT-TS and all atom CAD-score (CAD-AA)]
Fig. 2.
Fig. 2.
Correspondence of scores. Scatter plots for ‘A’ and ‘B’ score pairs. Horizontal direction represents values of score ‘A’, vertical direction represents score ‘B’. Increasing color intensity represents the increasing local density of values
Fig. 3.
Fig. 3.
Correlation of scores. (A) Spearman’s rank correlation coefficients computed by averaging per target values. Coloring ranges from blue (high correlation) to red (low correlation). (B) Clustering of scores according to their correlations using multi-dimensional scaling (MDS) (Color version of this figure is available at Bioinformatics online.)
Fig. 4.
Fig. 4.
Score differences in selecting a better model out of two. (A) Fractions of model pairs, where the disagreement between scores exceeds the tolerance threshold. Differences are colored from blue (smallest) to red (largest). (B) Clustering of scores based on the analysis of model pairs with conflicting ranking (Color version of this figure is available at Bioinformatics online.)
Fig. 5.
Fig. 5.
Agreement between the scores in selecting the best model out of many. (A) Average fraction of the same selections. (B) Clustering of scores according to their agreement in selecting the best model
Fig. 6.
Fig. 6.
Score differences in selecting the best model out of many. (A) Average losses of Z-score computed by score B (vertical axis) when the best model is selected using score A (horizontal axis). Z-score losses are colored from blue (smallest) to red (largest). Note the overall asymmetry of Z-score loss for score pairs. (B) Clustering of scores according to average Z-score losses (Color version of this figure is available at Bioinformatics online.)
Fig. 7.
Fig. 7.
Conflicting rankings of model pairs ‘judged’ using MolProbity. Fractions of conflicting model pairs for which MolProbity supports score ‘A’ (vertical axis) over score ‘B’ (horizontal axis). The MolProbity support is colored from blue (largest) to red (smallest) (Color version of this figure is available at Bioinformatics online.)
Fig. 8.
Fig. 8.
Conflicting rankings of model pairs ‘judged’ by the number of reproduced hydrogen bonds. The scores are compared using differences larger than a defined threshold (A) for all hydrogen bonds and (B) for only non-local hydrogen bonds
Fig. 9.
Fig. 9.
Grishin plots reflecting sensitivity of different scores to the relative orientation of protein domains. Horizontal axis indicates the score values for the full structure, vertical axis indicates weighted sum of scores for individual domains
Fig. 10.
Fig. 10.
Different types of local structural deviations. CASP10 target T0663-D1 (blue) superimposed with three models: (A) TS301_5 (orange), (B) TS301_3 (yellow) and (C) TS476_4 (grey). Red arrow indicates the C-terminal region featuring different types of deviation in the models (Color version of this figure is available at Bioinformatics online.)
Fig. 11.
Fig. 11.
Response of the scores to model completeness. Horizontal axis indicates the model completeness as the percentage of residues modeled. Vertical axis indicates mean Z-score for models of the same degree of completeness. Z-scores are averaged using left-sided sliding window of 15%, e.g. values at 80% are averages of 95–80% range

References

    1. Altman N., Krzywinski M. (2015) Association, correlation and causation. Nat. Methods, 12, 899–900. - PubMed
    1. Chen V.B. et al. (2010) MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. D Biol. Crystallogr., 66, 12–21. - PMC - PubMed
    1. Cong Q. et al. (2011) An automatic method for CASP9 free modeling structure prediction assessment. Bioinformatics, 27, 3371–3378. - PMC - PubMed
    1. Fisher R.A. (1915) Frequency distribution of the values of the correlation coefficient in samples of an indefinitely large population. Biometrika, 10, 507–521.
    1. Haas J. et al. (2018) Continuous Automated Model EvaluatiOn (CAMEO) complementing the critical assessment of structure prediction in CASP12. Proteins, 86, 387–398. - PMC - PubMed

Publication types