Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Apr 4:6:23990.
doi: 10.1038/srep23990.

Protein single-model quality assessment by feature-based probability density functions

Affiliations

Protein single-model quality assessment by feature-based probability density functions

Renzhi Cao et al. Sci Rep. .

Abstract

Protein quality assessment (QA) has played an important role in protein structure prediction. We developed a novel single-model quality assessment method-Qprob. Qprob calculates the absolute error for each protein feature value against the true quality scores (i.e. GDT-TS scores) of protein structural models, and uses them to estimate its probability density distribution for quality assessment. Qprob has been blindly tested on the 11th Critical Assessment of Techniques for Protein Structure Prediction (CASP11) as MULTICOM-NOVEL server. The official CASP result shows that Qprob ranks as one of the top single-model QA methods. In addition, Qprob makes contributions to our protein tertiary structure predictor MULTICOM, which is officially ranked 3rd out of 143 predictors. The good performance shows that Qprob is good at assessing the quality of models of hard targets. These results demonstrate that this new probability density distribution based method is effective for protein single-model quality assessment and is useful for protein structure prediction. The webserver of Qprob is available at: http://calla.rnet.missouri.edu/qprob/. The software is now freely available in the web server of Qprob.

PubMed Disclaimer

Figures

Figure 1
Figure 1. The relationship of three energy scores (DFIRE2, RWplus, and RF_CB_SRS_OD scores) and sequence length on PISCES database.
Figure 2
Figure 2. The probability density distributions for the error estimation of all 11 feature scores.
Figure 3
Figure 3. The summation of Z-score for the top 1 models selected by each method.

References

    1. Li J., Cao R. & Cheng J. A large-scale conformation sampling and evaluation server for protein tertiary structure prediction and its assessment in CASP11. BMC bioinformatics 16, 337 (2015). - PMC - PubMed
    1. Cao R., Bhattacharya D., Adhikari B., Li J. & Cheng J. Large-scale model quality assessment for improving protein tertiary structure prediction. Bioinformatics 31, i116–i123 (2015). - PMC - PubMed
    1. Wang Z., Cao R. & Cheng J. Three-level prediction of protein function by combining profile-sequence search, profile-profile search, and domain co-occurrence networks. BMC bioinformatics 14, S3 (2013). - PMC - PubMed
    1. Cao R., Bhattacharya D., Adhikari B., Li J. & Cheng J. Massive integration of diverse protein quality assessment methods to improve template based modeling in CASP11. Proteins: Structure, Function, and Bioinformatics, doi: 10.1002/prot.24924 (2015). - DOI - PMC - PubMed
    1. Cao R. & Cheng J. Integrated protein function prediction by mining function associations, sequences, and protein-protein and gene-gene interaction networks. Methods 93, 84–91 (2016). - PMC - PubMed

Publication types