Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Jul 9:2013:bat054.
doi: 10.1093/database/bat054. Print 2013.

A guide to best practices for Gene Ontology (GO) manual annotation

Affiliations

A guide to best practices for Gene Ontology (GO) manual annotation

Rama Balakrishnan et al. Database (Oxford). .

Abstract

The Gene Ontology Consortium (GOC) is a community-based bioinformatics project that classifies gene product function through the use of structured controlled vocabularies. A fundamental application of the Gene Ontology (GO) is in the creation of gene product annotations, evidence-based associations between GO definitions and experimental or sequence-based analysis. Currently, the GOC disseminates 126 million annotations covering >374,000 species including all the kingdoms of life. This number includes two classes of GO annotations: those created manually by experienced biocurators reviewing the literature or by examination of biological data (1.1 million annotations covering 2226 species) and those generated computationally via automated methods. As manual annotations are often used to propagate functional predictions between related proteins within and between genomes, it is critical to provide accurate consistent manual annotations. Toward this goal, we present here the conventions defined by the GOC for the creation of manual annotation. This guide represents the best practices for manual annotation as established by the GOC project over the past 12 years. We hope this guide will encourage research communities to annotate gene products of their interest to enhance the corpus of GO annotations available to all. DATABASE URL: http://www.geneontology.org.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
GO Term ‘leukotriene-A4 hydrolase activity’ [GO:0004463], one of the terms mentioned in the main text of the article, as seen in AmiGO (16, http://amigo.geneontology.org). (a) Graphical view of the ontology structure showing the most granular term ‘leukotriene-A4 hydrolase activity’ [GO:0004463] at the bottom (highlighted in red), and all its parent terms leading up to the root node (‘molecular_function’ [GO:0003674]) at the top. Each box representing a GO term includes the GO identifier, and the blue line connecting the terms represent the ontological relationship ‘is_a’ (implying that a child term is a subtype of the parent term). (b) Alternate text display for viewing the ontology structure. ‘leukotriene-A4 hydrolase activity’ [GO:0004463] is highlighted in red. Each child term is indented from its parent to indicate the depth of the tree. Apart from the GOID and GO term, each row includes other pieces of information that are important to understand the ontology and the annotations to each term. Starting from the left end of the row, the + sign indicates that there are child terms for that node and clicking on the + sign opens the browser to display the child terms. Next the small icon ‘i’ indicates the term is related to its parent by an is–a relationship (explained above). At the right end of the row in brackets is the total number of gene products annotated to that term and all its child terms. (c) Term information relevant to making an annotation is highlighted in red, which includes the GOID, Aspect of the ontology (Molecular Function), Synonyms and Definition of the term.
Figure 2.
Figure 2.
GO Evidence code decision tree describing the process of choosing an evidence code. This flow chart is meant to orient the biocurator on the different categories of evidence codes and does not include the complete definitions of the evidence codes (Table 2). This chart will aid the biocurator to evaluate the reported method or results and map them to an appropriate evidence code; the biocurator should consult the detailed evidence code documentation available online from http://www.geneontology.org/GO.evidence.shtml.

References

    1. The Gene Ontology Consortium. Gene Ontology Annotations and Resources. Nucleic Acids Res. 2013;41:D530–D535. - PMC - PubMed
    1. The Gene Ontology Consortium. Gene ontology: tool for the unification of biology. Nat. Genet. 2000;25:25–29. - PMC - PubMed
    1. The Gene Ontology Consortium. The Gene Ontology in 2010: extensions and refinements. Nucleic Acids Res. 2010;38:D331–D335. - PMC - PubMed
    1. Barrell D, Dimmer E, Huntley R, et al. The GOA database in 2009–an integrated Gene Ontology Annotation resource. Nucleic Acids Res. 2009;37:D396–D403. - PMC - PubMed
    1. Speidel D, Bruederle CE, Enk C, et al. CAPS1 regulates catecholamine loading of large dense-core vesicles. Neuron. 2005;46:75–88. - PubMed

Publication types