Predictors of natively unfolded proteins: unanimous consensus score to detect a twilight zone between order and disorder in generic datasets
- PMID: 20409339
- PMCID: PMC2877690
- DOI: 10.1186/1471-2105-11-198
Predictors of natively unfolded proteins: unanimous consensus score to detect a twilight zone between order and disorder in generic datasets
Abstract
Background: Natively unfolded proteins lack a well defined three dimensional structure but have important biological functions, suggesting a re-assignment of the structure-function paradigm. To assess that a given protein is natively unfolded requires laborious experimental investigations, then reliable sequence-only methods for predicting whether a sequence corresponds to a folded or to an unfolded protein are of interest in fundamental and applicative studies. Many proteins have amino acidic compositions compatible both with the folded and unfolded status, and belong to a twilight zone between order and disorder. This makes difficult a dichotomic classification of protein sequences into folded and natively unfolded ones. In this work we propose an operational method to identify proteins belonging to the twilight zone by combining into a consensus score good performing single predictors of folding.
Results: In this methodological paper dichotomic folding indexes are considered: hydrophobicity-charge, mean packing, mean pairwise energy, Poodle-W and a new global index, that is called here gVSL2, based on the local disorder predictor VSL2. The performance of these indexes is evaluated on different datasets, in particular on a new dataset composed by 2369 folded and 81 natively unfolded proteins. Poodle-W, gVSL2 and mean pairwise energy have good performance and stability in all the datasets considered and are combined into a strictly unanimous combination score SSU, that leaves proteins unclassified when the consensus of all combined indexes is not reached. The unclassified proteins: i) belong to an overlap region in the vector space of amino acidic compositions occupied by both folded and unfolded proteins; ii) are composed by approximately the same number of order-promoting and disorder-promoting amino acids; iii) have a mean flexibility intermediate between that of folded and that of unfolded proteins.
Conclusions: Our results show that proteins unclassified by SSU belong to a twilight zone. Proteins left unclassified by the consensus score SSU have physical properties intermediate between those of folded and those of natively unfolded proteins and their structural properties and evolutionary history are worth to be investigated.
Figures








Similar articles
-
[Prediction of natively unfolded regions in protein chain].Mol Biol (Mosk). 2006 Mar-Apr;40(2):341-8. Mol Biol (Mosk). 2006. PMID: 16637275 Russian.
-
Why are "natively unfolded" proteins unstructured under physiologic conditions?Proteins. 2000 Nov 15;41(3):415-27. doi: 10.1002/1097-0134(20001115)41:3<415::aid-prot130>3.0.co;2-7. Proteins. 2000. PMID: 11025552
-
To be folded or to be unfolded?Protein Sci. 2004 Nov;13(11):2871-7. doi: 10.1110/ps.04881304. Protein Sci. 2004. PMID: 15498936 Free PMC article.
-
Amyloidogenesis of natively unfolded proteins.Curr Alzheimer Res. 2008 Jun;5(3):260-87. doi: 10.2174/156720508784533312. Curr Alzheimer Res. 2008. PMID: 18537543 Free PMC article. Review.
-
Natively unfolded proteins.Curr Opin Struct Biol. 2005 Feb;15(1):35-41. doi: 10.1016/j.sbi.2005.01.002. Curr Opin Struct Biol. 2005. PMID: 15718131 Review.
Cited by
-
Relationships between kinetic constants and the amino acid composition of enzymes from the yeast Saccharomyces cerevisiae glycolysis pathway.EURASIP J Bioinform Syst Biol. 2012 Aug 6;2012(1):11. doi: 10.1186/1687-4153-2012-11. EURASIP J Bioinform Syst Biol. 2012. PMID: 22867018 Free PMC article.
-
Evolutionary coupling analysis guides identification of mistrafficking-sensitive variants in cardiac K+ channels: Validation with hERG.Front Pharmacol. 2022 Oct 20;13:1010119. doi: 10.3389/fphar.2022.1010119. eCollection 2022. Front Pharmacol. 2022. PMID: 36339618 Free PMC article.
-
Relationship between Metabolic Fluxes and Sequence-Derived Properties of Enzymes.Int Sch Res Notices. 2014 Oct 29;2014:817102. doi: 10.1155/2014/817102. eCollection 2014. Int Sch Res Notices. 2014. PMID: 27437461 Free PMC article.
References
-
- Dunker A, Lawson J, Brown C, Romero P, Oh J, Oldfield C, Campen A, Ratliffl C, Hipps K, Ausio J, Nissen M, Reeves R, Kang C, Kissinger C, Bailey R, Griswold M, Chin W, Garner E, Obradovic Z. Intrinsically disordered proteins. J Mol Graph Model. 2001;19:26–59. doi: 10.1016/S1093-3263(00)00138-8. - DOI - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources