Prediction of unfolded segments in a protein sequence based on amino acid composition
- PMID: 15657106
- DOI: 10.1093/bioinformatics/bti266
Prediction of unfolded segments in a protein sequence based on amino acid composition
Abstract
Motivation: Partially and wholly unstructured proteins have now been identified in all kingdoms of life--more commonly in eukaryotic organisms. This intrinsic disorder is related to certain critical functions. Apart from their fundamental interest, unstructured regions in proteins may prevent crystallization. Therefore, the prediction of disordered regions is an important aspect for the understanding of protein function, but may also help to devise genetic constructs.
Results: In this paper we present a computational tool for the detection of unstructured regions in proteins based on two properties of unfolded fragments: (1) disordered regions have a biased composition and (2) they usually contain either small or no hydrophobic clusters. In order to quantify these two facts we first calculate the amino acid distributions in structured and unstructured regions. Using this distribution, we calculate for a given sequence fragment the probability to be part of either a structured or an unstructured region. For each amino acid, the distance to the nearest hydrophobic cluster is also computed. Using these three values along a protein sequence allows us to predict unstructured regions, with very simple rules. This method requires only the primary sequence, and no multiple alignment, which makes it an adequate method for orphan proteins.
Availability: http://genomics.eu.org/
Similar articles
-
IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content.Bioinformatics. 2005 Aug 15;21(16):3433-4. doi: 10.1093/bioinformatics/bti541. Epub 2005 Jun 14. Bioinformatics. 2005. PMID: 15955779
-
FoldIndex: a simple tool to predict whether a given protein sequence is intrinsically unfolded.Bioinformatics. 2005 Aug 15;21(16):3435-8. doi: 10.1093/bioinformatics/bti537. Epub 2005 Jun 14. Bioinformatics. 2005. PMID: 15955783
-
POODLE-L: a two-level SVM prediction system for reliably predicting long disordered regions.Bioinformatics. 2007 Aug 15;23(16):2046-53. doi: 10.1093/bioinformatics/btm302. Epub 2007 Jun 1. Bioinformatics. 2007. PMID: 17545177
-
State-of-the-art bioinformatics protein structure prediction tools (Review).Int J Mol Med. 2011 Sep;28(3):295-310. doi: 10.3892/ijmm.2011.705. Epub 2011 May 23. Int J Mol Med. 2011. PMID: 21617841 Review.
-
Bioinformatical approaches to characterize intrinsically disordered/unstructured proteins.Brief Bioinform. 2010 Mar;11(2):225-43. doi: 10.1093/bib/bbp061. Epub 2009 Dec 10. Brief Bioinform. 2010. PMID: 20007729 Review.
Cited by
-
An interrupted beta-propeller and protein disorder: structural bioinformatics insights into the N-terminus of alsin.J Mol Model. 2009 Feb;15(2):113-22. doi: 10.1007/s00894-008-0381-1. Epub 2008 Nov 21. J Mol Model. 2009. PMID: 19023603
-
Fungal CSL transcription factors.BMC Genomics. 2007 Jul 13;8:233. doi: 10.1186/1471-2164-8-233. BMC Genomics. 2007. PMID: 17629904 Free PMC article.
-
The translation regulatory subunit eIF3f controls the kinase-dependent mTOR signaling required for muscle differentiation and hypertrophy in mouse.PLoS One. 2010 Feb 1;5(2):e8994. doi: 10.1371/journal.pone.0008994. PLoS One. 2010. PMID: 20126553 Free PMC article.
-
Structural disorder within Henipavirus nucleoprotein and phosphoprotein: from predictions to experimental assessment.PLoS One. 2010 Jul 21;5(7):e11684. doi: 10.1371/journal.pone.0011684. PLoS One. 2010. PMID: 20657787 Free PMC article.
-
Spritz: a server for the prediction of intrinsically disordered regions in protein sequences using kernel machines.Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W164-8. doi: 10.1093/nar/gkl166. Nucleic Acids Res. 2006. PMID: 16844983 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases