Integrating protein-protein interactions and text mining for protein function prediction
- PMID: 18673526
- PMCID: PMC2500093
- DOI: 10.1186/1471-2105-9-S8-S2
Integrating protein-protein interactions and text mining for protein function prediction
Abstract
Background: Functional annotation of proteins remains a challenging task. Currently the scientific literature serves as the main source for yet uncurated functional annotations, but curation work is slow and expensive. Automatic techniques that support this work are still lacking reliability. We developed a method to identify conserved protein interaction graphs and to predict missing protein functions from orthologs in these graphs. To enhance the precision of the results, we furthermore implemented a procedure that validates all predictions based on findings reported in the literature.
Results: Using this procedure, more than 80% of the GO annotations for proteins with highly conserved orthologs that are available in UniProtKb/Swiss-Prot could be verified automatically. For a subset of proteins we predicted new GO annotations that were not available in UniProtKb/Swiss-Prot. All predictions were correct (100% precision) according to the verifications from a trained curator.
Conclusion: Our method of integrating CCSs and literature mining is thus a highly reliable approach to predict GO annotations for weakly characterized proteins with orthologs.
Figures



Similar articles
-
Automatic extraction of gene ontology annotation and its correlation with clusters in protein networks.BMC Bioinformatics. 2007 Jul 10;8:243. doi: 10.1186/1471-2105-8-243. BMC Bioinformatics. 2007. PMID: 17620146 Free PMC article.
-
GOAnnotator: linking protein GO annotations to evidence text.J Biomed Discov Collab. 2006 Dec 20;1:19. doi: 10.1186/1747-5333-1-19. J Biomed Discov Collab. 2006. PMID: 17181854 Free PMC article.
-
Evaluation of BioCreAtIvE assessment of task 2.BMC Bioinformatics. 2005;6 Suppl 1(Suppl 1):S16. doi: 10.1186/1471-2105-6-S1-S16. Epub 2005 May 24. BMC Bioinformatics. 2005. PMID: 15960828 Free PMC article.
-
Text as data: using text-based features for proteins representation and for computational prediction of their characteristics.Methods. 2015 Mar;74:54-64. doi: 10.1016/j.ymeth.2014.10.027. Epub 2014 Nov 15. Methods. 2015. PMID: 25448299 Review.
-
How to learn about gene function: text-mining or ontologies?Methods. 2015 Mar;74:3-15. doi: 10.1016/j.ymeth.2014.07.004. Epub 2014 Aug 1. Methods. 2015. PMID: 25088781 Review.
Cited by
-
Computational models for prediction of protein-protein interaction in rice and Magnaporthe grisea.Front Plant Sci. 2023 Feb 1;13:1046209. doi: 10.3389/fpls.2022.1046209. eCollection 2022. Front Plant Sci. 2023. PMID: 36816487 Free PMC article.
-
Computational prediction of the human-microbial oral interactome.BMC Syst Biol. 2014 Feb 27;8:24. doi: 10.1186/1752-0509-8-24. BMC Syst Biol. 2014. PMID: 24576332 Free PMC article.
-
Integrated web visualizations for protein-protein interaction databases.BMC Bioinformatics. 2015 Jun 16;16(1):195. doi: 10.1186/s12859-015-0615-z. BMC Bioinformatics. 2015. PMID: 26077899 Free PMC article.
-
A comprehensive benchmark of kernel methods to extract protein-protein interactions from literature.PLoS Comput Biol. 2010 Jul 1;6(7):e1000837. doi: 10.1371/journal.pcbi.1000837. PLoS Comput Biol. 2010. PMID: 20617200 Free PMC article.
-
Identification of mitochondrial disease genes through integrative analysis of multiple datasets.Methods. 2008 Dec;46(4):248-55. doi: 10.1016/j.ymeth.2008.10.002. Epub 2008 Oct 16. Methods. 2008. PMID: 18930150 Free PMC article.
References
-
- Pandey G, Kumar V, Steinbach M. Tech Rep TR-06-028. Department of Computer Science and Engineering, University of Minnesota; 2006. Computational Approaches for Protein Function Prediction: A survey.
-
- Baxter SM, Fetrow JS. Sequence- and structure-based protein function prediction from genomic information. Curr Opin Drug Discov Devel. 2001;4:291–295. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources