A hidden markov model derived structural alphabet for proteins
- PMID: 15147844
- DOI: 10.1016/j.jmb.2004.04.005
A hidden markov model derived structural alphabet for proteins
Abstract
Understanding and predicting protein structures depends on the complexity and the accuracy of the models used to represent them. We have set up a hidden Markov model that discretizes protein backbone conformation as series of overlapping fragments (states) of four residues length. This approach learns simultaneously the geometry of the states and their connections. We obtain, using a statistical criterion, an optimal systematic decomposition of the conformational variability of the protein peptidic chain in 27 states with strong connection logic. This result is stable over different protein sets. Our model fits well the previous knowledge related to protein architecture organisation and seems able to grab some subtle details of protein organisation, such as helix sub-level organisation schemes. Taking into account the dependence between the states results in a description of local protein structure of low complexity. On an average, the model makes use of only 8.3 states among 27 to describe each position of a protein structure. Although we use short fragments, the learning process on entire protein conformations captures the logic of the assembly on a larger scale. Using such a model, the structure of proteins can be reconstructed with an average accuracy close to 1.1A root-mean-square deviation and for a complexity of only 3. Finally, we also observe that sequence specificity increases with the number of states of the structural alphabet. Such models can constitute a very relevant approach to the analysis of protein architecture in particular for protein structure prediction.
Similar articles
-
The complexity and accuracy of discrete state models of protein structure.J Mol Biol. 1995 Jun 2;249(2):493-507. doi: 10.1006/jmbi.1995.0311. J Mol Biol. 1995. PMID: 7783205
-
A fast method for large-scale de novo peptide and miniprotein structure prediction.J Comput Chem. 2010 Mar;31(4):726-38. doi: 10.1002/jcc.21365. J Comput Chem. 2010. PMID: 19569182
-
Hidden Markov model-derived structural alphabet for proteins: the learning of protein local shapes captures sequence specificity.Biochim Biophys Acta. 2005 Aug 5;1724(3):394-403. doi: 10.1016/j.bbagen.2005.05.019. Biochim Biophys Acta. 2005. PMID: 16040198 Review.
-
Enhanced protein fold recognition using a structural alphabet.Proteins. 2009 Jul;76(1):129-37. doi: 10.1002/prot.22324. Proteins. 2009. PMID: 19089985
-
Methods for optimizing the structure alphabet sequences of proteins.Comput Biol Med. 2007 Nov;37(11):1610-6. doi: 10.1016/j.compbiomed.2007.03.002. Epub 2007 May 10. Comput Biol Med. 2007. PMID: 17493604
Cited by
-
CD9 inhibition reveals a functional connection of extracellular vesicle secretion with mitophagy in melanoma cells.J Extracell Vesicles. 2021 May;10(7):e12082. doi: 10.1002/jev2.12082. Epub 2021 May 12. J Extracell Vesicles. 2021. PMID: 34012515 Free PMC article.
-
Designing of Potential Polyvalent Vaccine Model for Respiratory Syncytial Virus by System Level Immunoinformatics Approaches.Biomed Res Int. 2021 May 28;2021:9940010. doi: 10.1155/2021/9940010. eCollection 2021. Biomed Res Int. 2021. PMID: 34136576 Free PMC article.
-
Sampling realistic protein conformations using local structural bias.PLoS Comput Biol. 2006 Sep 22;2(9):e131. doi: 10.1371/journal.pcbi.0020131. Epub 2006 Aug 21. PLoS Comput Biol. 2006. PMID: 17002495 Free PMC article.
-
Mining protein loops using a structural alphabet and statistical exceptionality.BMC Bioinformatics. 2010 Feb 4;11:75. doi: 10.1186/1471-2105-11-75. BMC Bioinformatics. 2010. PMID: 20132552 Free PMC article.
-
Structural deformation upon protein-protein interaction: a structural alphabet approach.BMC Struct Biol. 2008 Feb 28;8:12. doi: 10.1186/1472-6807-8-12. BMC Struct Biol. 2008. PMID: 18307769 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources