Building a better fragment library for de novo protein structure prediction
- PMID: 25901595
- PMCID: PMC4406757
- DOI: 10.1371/journal.pone.0123998
Building a better fragment library for de novo protein structure prediction
Abstract
Fragment-based approaches are the current standard for de novo protein structure prediction. These approaches rely on accurate and reliable fragment libraries to generate good structural models. In this work, we describe a novel method for structure fragment library generation and its application in fragment-based de novo protein structure prediction. The importance of correct testing procedures in assessing the quality of fragment libraries is demonstrated. In particular, the exclusion of homologs to the target from the libraries to correctly simulate a de novo protein structure prediction scenario, something which surprisingly is not always done. We demonstrate that fragments presenting different predominant predicted secondary structures should be treated differently during the fragment library generation step and that exhaustive and random search strategies should both be used. This information was used to develop a novel method, Flib. On a validation set of 41 structurally diverse proteins, Flib libraries presents both a higher precision and coverage than two of the state-of-the-art methods, NNMake and HHFrag. Flib also achieves better precision and coverage on the set of 275 protein domains used in the two previous experiments of the the Critical Assessment of Structure Prediction (CASP9 and CASP10). We compared Flib libraries against NNMake libraries in a structure prediction context. Of the 13 cases in which a correct answer was generated, Flib models were more accurate than NNMake models for 10. "Flib is available for download at: http://www.stats.ox.ac.uk/research/proteins/resources".
Conflict of interest statement
Figures








Similar articles
-
Combining co-evolution and secondary structure prediction to improve fragment library generation.Bioinformatics. 2018 Jul 1;34(13):2219-2227. doi: 10.1093/bioinformatics/bty084. Bioinformatics. 2018. PMID: 29462243
-
LRFragLib: an effective algorithm to identify fragments for de novo protein structure prediction.Bioinformatics. 2017 Mar 1;33(5):677-684. doi: 10.1093/bioinformatics/btw668. Bioinformatics. 2017. PMID: 27797773
-
Construct a variable-length fragment library for de novo protein structure prediction.Brief Bioinform. 2022 May 13;23(3):bbac086. doi: 10.1093/bib/bbac086. Brief Bioinform. 2022. PMID: 35284936
-
Computational approaches for protein function prediction: a combined strategy from multiple sequence alignment to molecular docking-based virtual screening.Biochim Biophys Acta. 2010 Sep;1804(9):1695-712. doi: 10.1016/j.bbapap.2010.04.008. Epub 2010 Apr 28. Biochim Biophys Acta. 2010. PMID: 20433957 Review.
-
Protein structure prediction: challenging targets for CASP10.J Biomol Struct Dyn. 2012;30(5):607-15. doi: 10.1080/07391102.2012.687526. Epub 2012 Jun 26. J Biomol Struct Dyn. 2012. PMID: 22731875 Review.
Cited by
-
Computational Methods for the Elucidation of Protein Structure and Interactions.Methods Mol Biol. 2021;2305:23-52. doi: 10.1007/978-1-0716-1406-8_2. Methods Mol Biol. 2021. PMID: 33950383 Review.
-
Enhancing fragment-based protein structure prediction by customising fragment cardinality according to local secondary structure.BMC Bioinformatics. 2020 May 1;21(1):170. doi: 10.1186/s12859-020-3491-0. BMC Bioinformatics. 2020. PMID: 32357827 Free PMC article.
-
Improved fragment-based protein structure prediction by redesign of search heuristics.Sci Rep. 2018 Sep 12;8(1):13694. doi: 10.1038/s41598-018-31891-8. Sci Rep. 2018. PMID: 30209258 Free PMC article.
-
Toward a detailed understanding of search trajectories in fragment assembly approaches to protein structure prediction.Proteins. 2016 Apr;84(4):411-26. doi: 10.1002/prot.24987. Epub 2016 Feb 23. Proteins. 2016. PMID: 26799916 Free PMC article.
-
Assigning secondary structure in proteins using AI.J Mol Model. 2021 Aug 17;27(9):252. doi: 10.1007/s00894-021-04825-x. J Mol Model. 2021. PMID: 34402969
References
-
- Bradley P, Misura KM, Baker D. Toward high-resolution de novo structure prediction for small proteins. Science 309, 1868–71. (2005) - PubMed
-
- Bonneau R, Strauss CE, Rohl CA, Chivian D, Bradley P, Malmstrom L et al. De novo prediction of three-dimensional structures for major protein families. J Mol Biol 322(1):65–78 (2002) - PubMed
-
- Bonneau R, Tsai J, Ruczinski I, Chivian D, Rohl C, Strauss CE et al. Rosetta in CASP4: progress in ab initio protein structure prediction. Proteins Suppl 5:119–26 (2001) - PubMed
Publication types
MeSH terms
Substances
Associated data
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous