Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017:1446:25-37.
doi: 10.1007/978-1-4939-3743-1_3.

Primer on the Gene Ontology

Affiliations

Primer on the Gene Ontology

Pascale Gaudet et al. Methods Mol Biol. 2017.

Abstract

The Gene Ontology (GO) project is the largest resource for cataloguing gene function. The combination of solid conceptual underpinnings and a practical set of features have made the GO a widely adopted resource in the research community and an essential resource for data analysis. In this chapter, we provide a concise primer for all users of the GO. We briefly introduce the structure of the ontology and explain how to interpret annotations associated with the GO.

Keywords: Annotation evidence; Annotations; Evidence codes; Function; GO files; Gene Ontology structure; Gene association file (GAF); Vocabulary.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
The structure of the Gene Ontology (GO) is illustrated on a subset of the paths of the term “regulation of cell projection assembly,” GO:0060491, to its root term. The GO is a directed graph with terms as nodes and relationships as edges; these relationships are either is_a, part_of, has_part, or regulates. In its basic representation, there should be no cycles in this graph, and we can therefore establish parent (more general) and child (more specific) terms (see Chap. XX for more details on the different representations; cross-reference to Moni’s chapter). Note that it is possible for a term to have multiple parents. This figure is based on the visualization available from the AmiGO browser, generated on November 6, 2015. (3).
Figure 2.
Figure 2.. Gene Association File (GAF) 2.1 file format described with example elements.
In the GAF file, each row represents an annotation, consisting of up to 17 tab-delimited fields (or columns). This figure describes these fields in the order in which they are found in the GAF file. Light blue colour denotes non-mandatory fields, and these are allowed to be empty in the GAF file. The cardinality—the number of elements in the field — is denoted with the symbol(s) in curly brackets: {?} indicates cardinality of zero or one; {*} indicates that any cardinality is allowed; {+} indicates cardinality of one or more; {1} indicates that cardinality is exactly one; {1,2} indicates that cardinality is either one or two. When cardinality is greater than 1, elements in the field are separated with a pipe character or with a comma; the former indicates ‘OR’ and the latter indicates ‘AND’. The GO term assigned in column 5 is always the most specific GO term possible.
Figure 3:
Figure 3:. GO Evidence Codes and their abbreviations.
The type of information supporting annotations is recorded with Evidence Codes, which can be grouped into three main categories: experimental evidence codes, curated non-experimental annotations, and automatically assigned annotations. The obsolete evidence code NR (Not Recorded) is not included in the figure. Documentation about the different types of automatically assigned annotations can be found at http://www.geneontology.org/doc/GO.references.

References

    1. Ashburner M, Ball CA, Blake JA, et al. (2000) Gene ontology: tool for the unification of biology. The Gene Ontology Consortium, Nature genetics. 25, 25–29. - PMC - PubMed
    1. Friedmann HC (2004) From “butyribacterium” to “E. coli”: an essay on unity in biochemistry, Perspectives in biology and medicine. 47, 47–66. - PubMed
    1. Carbon S, Ireland A, Mungall CJ, et al. (2009) AmiGO: online access to ontology and annotation data, Bioinformatics. 25, 288–289. - PMC - PubMed
    1. Rhee SY, Wood V, Dolinski K, et al. (2008) Use and misuse of the gene ontology annotations, Nature reviews. Genetics 9, 509–515. - PubMed
    1. Arnaud MB, Costanzo MC, Shah P, et al. (2009) Gene Ontology and the annotation of pathogen genomes: the case of Candida albicans, Trends in microbiology. 17, 295–303. - PMC - PubMed

Publication types

LinkOut - more resources