Protein function annotation from sequence: prediction of residues interacting with RNA
- PMID: 19389733
- DOI: 10.1093/bioinformatics/btp257
Protein function annotation from sequence: prediction of residues interacting with RNA
Abstract
Motivation: All eukaryotic proteomes are characterized by a significant percentage of proteins of unknown function. Comp-utational function prediction methods are therefore essential as initial steps in the function annotation process. This article describes an annotation method (PiRaNhA) for the prediction of RNA-binding residues (RBRs) from protein sequence information. A series of sequence properties (position specific scoring matrices, interface propensities, predicted accessibility and hydrophobicity) are used to train a support vector machine. This method is then evaluated for its potential to be applied to RNA-binding function prediction at the level of the complete protein.
Results: The 5-fold cross-validation of PiRaNhA on a dataset of 81 RNA-binding proteins achieves a Matthews Correlation Coefficient (MCC) of 0.50 and accuracy of 87.2%. When used to predict RBRs in 42 proteins not used in training, PiRaNhA achieves an MCC of 0.41 and accuracy of 84.5%. Decision values from the PiRaNhA predictions were used in a second SVM to make predictions of RNA-binding function at the protein level, achieving an MCC of 0.53 and accuracy of 76.1%. The PiRaNhA RBR predictions allow experimentalists to perform more targeted experiments for function annotation; and the prediction of RNA-binding function at the protein level shows promise for proteome-wide annotations.
Availability and implementation: Freely available on the web at www.bioinformatics.sussex.ac.uk/PIRANHA or http://piranha.protein.osaka-u.ac.jp.
Supplementary information: Supplementary data are available at the Bioinformatics online.
Similar articles
-
PiRaNhA: a server for the computational prediction of RNA-binding residues in protein sequences.Nucleic Acids Res. 2010 Jul;38(Web Server issue):W412-6. doi: 10.1093/nar/gkq474. Epub 2010 May 27. Nucleic Acids Res. 2010. PMID: 20507911 Free PMC article.
-
RNA-binding residues in sequence space: conservation and interaction patterns.Comput Biol Chem. 2009 Oct;33(5):397-403. doi: 10.1016/j.compbiolchem.2009.07.012. Epub 2009 Jul 28. Comput Biol Chem. 2009. PMID: 19700370
-
Prediction of protein-RNA binding sites by a random forest method with combined features.Bioinformatics. 2010 Jul 1;26(13):1616-22. doi: 10.1093/bioinformatics/btq253. Epub 2010 May 18. Bioinformatics. 2010. PMID: 20483814
-
Computational Prediction of RNA-Binding Proteins and Binding Sites.Int J Mol Sci. 2015 Nov 3;16(11):26303-17. doi: 10.3390/ijms161125952. Int J Mol Sci. 2015. PMID: 26540053 Free PMC article. Review.
-
An Experimental Approach to Genome Annotation: This report is based on a colloquium sponsored by the American Academy of Microbiology held July 19-20, 2004, in Washington, DC.Washington (DC): American Society for Microbiology; 2004. Washington (DC): American Society for Microbiology; 2004. PMID: 33001599 Free Books & Documents. Review.
Cited by
-
Prediction of RNA- and DNA-Binding Proteins Using Various Machine Learning Classifiers.Avicenna J Med Biotechnol. 2019 Jan-Mar;11(1):104-111. Avicenna J Med Biotechnol. 2019. PMID: 30800250 Free PMC article.
-
Improving the prediction of yeast protein function using weighted protein-protein interactions.Theor Biol Med Model. 2011 Apr 27;8:11. doi: 10.1186/1742-4682-8-11. Theor Biol Med Model. 2011. PMID: 21524280 Free PMC article.
-
Exploiting structural and topological information to improve prediction of RNA-protein binding sites.BMC Bioinformatics. 2009 Oct 18;10:341. doi: 10.1186/1471-2105-10-341. BMC Bioinformatics. 2009. PMID: 19835626 Free PMC article.
-
Prediction of RNA binding proteins comes of age from low resolution to high resolution.Mol Biosyst. 2013 Oct;9(10):2417-25. doi: 10.1039/c3mb70167k. Mol Biosyst. 2013. PMID: 23872922 Free PMC article. Review.
-
PRIP: A Protein-RNA Interface Predictor Based on Semantics of Sequences.Life (Basel). 2022 Feb 18;12(2):307. doi: 10.3390/life12020307. Life (Basel). 2022. PMID: 35207594 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous