Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Jul 22;9 Suppl 8(Suppl 8):S2.
doi: 10.1186/1471-2105-9-S8-S2.

Integrating protein-protein interactions and text mining for protein function prediction

Affiliations

Integrating protein-protein interactions and text mining for protein function prediction

Samira Jaeger et al. BMC Bioinformatics. .

Abstract

Background: Functional annotation of proteins remains a challenging task. Currently the scientific literature serves as the main source for yet uncurated functional annotations, but curation work is slow and expensive. Automatic techniques that support this work are still lacking reliability. We developed a method to identify conserved protein interaction graphs and to predict missing protein functions from orthologs in these graphs. To enhance the precision of the results, we furthermore implemented a procedure that validates all predictions based on findings reported in the literature.

Results: Using this procedure, more than 80% of the GO annotations for proteins with highly conserved orthologs that are available in UniProtKb/Swiss-Prot could be verified automatically. For a subset of proteins we predicted new GO annotations that were not available in UniProtKb/Swiss-Prot. All predictions were correct (100% precision) according to the verifications from a trained curator.

Conclusion: Our method of integrating CCSs and literature mining is thus a highly reliable approach to predict GO annotations for weakly characterized proteins with orthologs.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Example CCS detected between H. sapiens and S. cerevisiae. The figure shows a conserved and connected subgraph between H. sapiens (circle) and S. cerevisiae (hexagon). Proteins of both species are involved in mRNA splicing and are known to exhibit splicing factor activity to bind the mRNA and support the splicing process. (Solid lines represent conserved PPIs within a species and dashed lines indicate orthology relationships between proteins.)
Figure 2
Figure 2
Schematic illustration for comparing GO annotations in this study. The flowchart summarizes all four combinations for comparing GO annotations of different resources. Protein – annotation associations were extracted from text with or without species identification and GO annotations from text and the UniProtKb/Swiss-Prot database were compared based on exact versus relative matching.
Figure 3
Figure 3
Schematic overview of the studied protein annotations. Schematic overview of the different sets of protein – annotation associations considered in this study.

Similar articles

Cited by

References

    1. Consortium GO. The Gene Ontology (GO) project in 2006. Nucleic Acids Res. 2006;34:D322–D326. doi: 10.1093/nar/gkj021. - DOI - PMC - PubMed
    1. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000;25:25–29. doi: 10.1038/75556. - DOI - PMC - PubMed
    1. Couto FM, Silva MJ, Lee V, Dimmer E, Camon E, Apweiler R, Kirsch H, Rebholz-Schuhmann D. GOAnnotator: linking protein GO annotations to evidence text. J Biomed Discov Collab. 2006;1:19. doi: 10.1186/1747-5333-1-19. - DOI - PMC - PubMed
    1. Pandey G, Kumar V, Steinbach M. Tech Rep TR-06-028. Department of Computer Science and Engineering, University of Minnesota; 2006. Computational Approaches for Protein Function Prediction: A survey.
    1. Baxter SM, Fetrow JS. Sequence- and structure-based protein function prediction from genomic information. Curr Opin Drug Discov Devel. 2001;4:291–295. - PubMed

Publication types

LinkOut - more resources