Enhanced protein fold recognition using a structural alphabet
- PMID: 19089985
- DOI: 10.1002/prot.22324
Enhanced protein fold recognition using a structural alphabet
Abstract
Fold recognition from sequence can be an important step in protein structure and function prediction. Many methods have tackled this goal. Most of them, based on sequence alignment, fail for sequences of low similarity. Alignment-free approaches can provide an efficient alternative. For such approaches, the identification of efficient fold discriminatory features is critical. We propose a new fold recognition approach that relies on the encoding of the local structure of proteins using a Hidden Markov Model Structural Alphabet. This encoding provides a 1D description of the conformation of complete proteins structures, including loops. At the fold level, compared with the classical secondary structure helix, strand, and coil states, such encoding is expected to provide the means of a better discrimination between loop conformations, hence providing better fold identification. Compared with previous related approaches, this supplement of information results in significant improvement. When combining this information with supplementary information of secondary structure and residue burial, we obtain a fold recognition accuracy of 78% for 27 protein families, that is, 8% higher than the best available method so far, and of 68% for 60 families. Corresponding scores at the class level are of 92% and 90% indicating that mispredictions are mostly within structural classes.
Similar articles
-
Hidden Markov models that use predicted local structure for fold recognition: alphabets of backbone geometry.Proteins. 2003 Jun 1;51(4):504-14. doi: 10.1002/prot.10369. Proteins. 2003. PMID: 12784210
-
A 3D-1D substitution matrix for protein fold recognition that includes predicted secondary structure of the sequence.J Mol Biol. 1997 Apr 11;267(4):1026-38. doi: 10.1006/jmbi.1997.0924. J Mol Biol. 1997. PMID: 9135128
-
Combining local-structure, fold-recognition, and new fold methods for protein structure prediction.Proteins. 2003;53 Suppl 6:491-6. doi: 10.1002/prot.10540. Proteins. 2003. PMID: 14579338
-
Sequence comparison and protein structure prediction.Curr Opin Struct Biol. 2006 Jun;16(3):374-84. doi: 10.1016/j.sbi.2006.05.006. Epub 2006 May 19. Curr Opin Struct Biol. 2006. PMID: 16713709 Review.
-
Protein structure prediction: recognition of primary, secondary, and tertiary structural features from amino acid sequence.Crit Rev Biochem Mol Biol. 1995;30(1):1-94. doi: 10.3109/10409239509085139. Crit Rev Biochem Mol Biol. 1995. PMID: 7587278 Review.
Cited by
-
Improving protein fold recognition using the amalgamation of evolutionary-based and structural based information.BMC Bioinformatics. 2014;15 Suppl 16(Suppl 16):S12. doi: 10.1186/1471-2105-15-S16-S12. Epub 2014 Dec 8. BMC Bioinformatics. 2014. PMID: 25521502 Free PMC article.
-
ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier.Biomed Res Int. 2016;2016:6802832. doi: 10.1155/2016/6802832. Epub 2016 Aug 28. Biomed Res Int. 2016. PMID: 27660761 Free PMC article.
-
Structural alphabets derived from attractors in conformational space.BMC Bioinformatics. 2010 Feb 20;11:97. doi: 10.1186/1471-2105-11-97. BMC Bioinformatics. 2010. PMID: 20170534 Free PMC article.
-
Local conformational changes in the DNA interfaces of proteins.PLoS One. 2013;8(2):e56080. doi: 10.1371/journal.pone.0056080. Epub 2013 Feb 13. PLoS One. 2013. PMID: 23418514 Free PMC article.
-
Detecting protein candidate fragments using a structural alphabet profile comparison approach.PLoS One. 2013 Nov 26;8(11):e80493. doi: 10.1371/journal.pone.0080493. eCollection 2013. PLoS One. 2013. PMID: 24303019 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources