Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005 Jul 1;33(Web Server issue):W783-6.
doi: 10.1093/nar/gki470.

GoPubMed: exploring PubMed with the Gene Ontology

Affiliations

GoPubMed: exploring PubMed with the Gene Ontology

Andreas Doms et al. Nucleic Acids Res. .

Abstract

The biomedical literature grows at a tremendous rate and PubMed comprises already over 15 000 000 abstracts. Finding relevant literature is an important and difficult problem. We introduce GoPubMed, a web server which allows users to explore PubMed search results with the Gene Ontology (GO), a hierarchically structured vocabulary for molecular biology. GoPubMed provides the following benefits: first, it gives an overview of the literature abstracts by categorizing abstracts according to the GO and thus allowing users to quickly navigate through the abstracts by category. Second, it automatically shows general ontology terms related to the original query, which often do not even appear directly in the abstract. Third, it enables users to verify its classification because GO terms are highlighted in the abstracts and as each term is labelled with an accuracy percentage. Fourth, exploring PubMed abstracts with GoPubMed is useful as it shows definitions of GO terms without the need for further look up. GoPubMed is online at www.gopubmed.org. Querying is currently limited to 100 papers per query.

PubMed Disclaimer

Figures

Figure 1
Figure 1
User interface of GoPubMed. The screen-shot of GoPubMed displays the results for the query ‘levamisole inhibitor’ limited to 100 papers. On the left, part of the GO relevant to the query is shown and on the right the abstracts for a selected GO term. The search terms are highlighted online in orange and the GO terms in green. Right of each abstract is a list with all the GO terms for that abstract ordered by an accuracy percentage. For example, the term ‘P-glycoprotein’, which is a synonym for the GO term ‘xenobiotec transporting ATPase’, is found with 100% accuracy, while ‘lung development’ matches only with 72%, as only the word ‘lung’ occurs in the abstract. Synonyms, such as the term ‘P-glycoprotein’ above, are displayed in dark grey and the synonymous term is given in a tool-tip (please note that Mozilla-based browsers do not currently break lines in tool-tips). Moving the mouse over the term displays the definition of the term in a tool-tip. The ontology on the left shows the paths from the root of the ontology—‘cellular component’, ‘biological process’ and ‘molecular function’—to the currently selected GO term. The number in brackets behind each GO term in the ontology is the number of papers the GO term or any of its children occur in. In the figure, the path from ‘molecular function’ to ‘alkaline phosphatase’ is shown and the number 71 behind the term ‘alkaline phosphatase’ indicates that there are 71 papers mentioning alkaline phosphatase. Clicking on the term displays the relevant abstracts, which confirm that levamisole inhibits alkaline phosphatase. Overall, the number of papers containing a term and its children is a very good indicator to let users select the most frequent terms and, thus, best representatives. Instead of using the ontology to browse through abstracts, users can also display all the abstracts in the same order as in PubMed with the additional benefit of displaying the GO terms and search keywords. Users can also search within the ontology using the input field at the bottom of the ontology. GoPubMed searches are currently limited to 100 papers per query. Answering a query takes ∼20 s.
Figure 2
Figure 2
GoPubMed displaying the results for ‘levamisole inhibitor’ that contain the GO term phosphofructokinase.

References

    1. Gene Ontology Consortium. The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res. 2004;1:D258–D261. - PMC - PubMed
    1. Delfs R., Doms A., Kozlenkov A., Schroeder M. GoPubMed: ontology-based literature search applied to GeneOntology and PubMed. Proceedings of German Bioinformatics Conference; Bielefeld, Germany: LNBI Springer; 2004. pp. 169–178.
    1. Müller H.-M., Kenny E.E., Sternberg P.W. Textpresso: an ontology-based information retrieval and extraction system for biological literature. PLoS Biol. 2003;2:2003. - PMC - PubMed
    1. Perez-Iratxeta C., Perez A.J., Bork P., Andrade M.A. Update on XplorMed: a web server for exploring scientific literature. Nucleic Acids Res. 2003;31:3866–3868. - PMC - PubMed

Publication types