Inferring function using patterns of native disorder in proteins
- PMID: 17722973
- PMCID: PMC1950950
- DOI: 10.1371/journal.pcbi.0030162
Inferring function using patterns of native disorder in proteins
Abstract
Natively unstructured regions are a common feature of eukaryotic proteomes. Between 30% and 60% of proteins are predicted to contain long stretches of disordered residues, and not only have many of these regions been confirmed experimentally, but they have also been found to be essential for protein function. In this study, we directly address the potential contribution of protein disorder in predicting protein function using standard Gene Ontology (GO) categories. Initially we analyse the occurrence of protein disorder in the human proteome and report ontology categories that are enriched in disordered proteins. Pattern analysis of the distributions of disordered regions in human sequences demonstrated that the functions of intrinsically disordered proteins are both length- and position-dependent. These dependencies were then encoded in feature vectors to quantify the contribution of disorder in human protein function prediction using Support Vector Machine classifiers. The prediction accuracies of 26 GO categories relating to signalling and molecular recognition are improved using the disorder features. The most significant improvements were observed for kinase, phosphorylation, growth factor, and helicase categories. Furthermore, we provide predicted GO term assignments using these classifiers for a set of unannotated and orphan human proteins. In this study, the importance of capturing protein disorder information and its value in function prediction is demonstrated. The GO category classifiers generated can be used to provide more reliable predictions and further insights into the behaviour of orphan and unannotated proteins.
Conflict of interest statement
Figures







Similar articles
-
POODLE-L: a two-level SVM prediction system for reliably predicting long disordered regions.Bioinformatics. 2007 Aug 15;23(16):2046-53. doi: 10.1093/bioinformatics/btm302. Epub 2007 Jun 1. Bioinformatics. 2007. PMID: 17545177
-
Natively disordered proteins: functions and predictions.Appl Bioinformatics. 2004;3(2-3):105-13. doi: 10.2165/00822942-200403020-00005. Appl Bioinformatics. 2004. PMID: 15693736 Review.
-
Identification of Intrinsically Unstructured Proteins using hierarchical classifier.Int J Data Min Bioinform. 2008;2(2):121-33. doi: 10.1504/ijdmb.2008.019093. Int J Data Min Bioinform. 2008. PMID: 18767350
-
RONN: the bio-basis function neural network technique applied to the detection of natively disordered regions in proteins.Bioinformatics. 2005 Aug 15;21(16):3369-76. doi: 10.1093/bioinformatics/bti534. Epub 2005 Jun 9. Bioinformatics. 2005. PMID: 15947016
-
Natively unfolded proteins.Curr Opin Struct Biol. 2005 Feb;15(1):35-41. doi: 10.1016/j.sbi.2005.01.002. Curr Opin Struct Biol. 2005. PMID: 15718131 Review.
Cited by
-
High-throughput prediction of RNA, DNA and protein binding regions mediated by intrinsic disorder.Nucleic Acids Res. 2015 Oct 15;43(18):e121. doi: 10.1093/nar/gkv585. Epub 2015 Jun 24. Nucleic Acids Res. 2015. PMID: 26109352 Free PMC article.
-
Protein Folding and Mechanisms of Proteostasis.Int J Mol Sci. 2015 Jul 28;16(8):17193-230. doi: 10.3390/ijms160817193. Int J Mol Sci. 2015. PMID: 26225966 Free PMC article. Review.
-
Predicting Protein Conformational Disorder and Disordered Binding Sites.Methods Mol Biol. 2022;2449:95-147. doi: 10.1007/978-1-0716-2095-3_4. Methods Mol Biol. 2022. PMID: 35507260
-
Expanding the repertoire of human tandem repeat RNA-binding proteins.PLoS One. 2023 Sep 20;18(9):e0290890. doi: 10.1371/journal.pone.0290890. eCollection 2023. PLoS One. 2023. PMID: 37729217 Free PMC article.
-
MeDor: a metaserver for predicting protein disorder.BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S25. doi: 10.1186/1471-2164-9-S2-S25. BMC Genomics. 2008. PMID: 18831791 Free PMC article.
References
-
- Friedberg I. Automated protein function prediction—The genomic challenge. Brief Bioinform. 2006;7:225–242. - PubMed
-
- Ofran Y, Punta M, Schneider R, Rost B. Beyond annotation transfer by homology novel protein-function prediction methods to assist drug discovery. Drug Discov Today. 2005;10:1475–1482. - PubMed
-
- Jensen LJ, Gupta R, Staerfeldt HH, Brunak S. Prediction of human protein function according to Gene Ontology categories. Bioinformatics. 2003;19:635–642. - PubMed
-
- Jensen LJ, Gupta R, Blom N, Devos D, Tamames J, et al. Prediction of human protein function from post-translational modifications and localization features. J Mol Biol. 2002;319:1257–1265. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources