Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2003 Apr;13(4):662-72.
doi: 10.1101/gr.461403. Epub 2003 Mar 12.

The Gene Ontology Annotation (GOA) project: implementation of GO in SWISS-PROT, TrEMBL, and InterPro

Affiliations

The Gene Ontology Annotation (GOA) project: implementation of GO in SWISS-PROT, TrEMBL, and InterPro

Evelyn Camon et al. Genome Res. 2003 Apr.

Abstract

Gene Ontology Annotation (GOA) is a project run by the European Bioinformatics Institute (EBI) that aims to provide assignments of terms from the Gene Ontology (GO) resource to gene products in a number of its databases (http://www.ebi.ac.uk/GOA). In the first stage of this project, GO assignments have been applied to a data set representing the complete human proteome by a combination of electronic mappings and manual curation. This vocabulary has also been applied to the nonredundant proteome sets for all other completely sequenced organisms as well as to proteins from a wide range of organisms where the proteome is not yet complete.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
InterPro entry IPR000040 (acute myeloid leukemia 1 protein [AML 1]/Runt) that shows the GO terms that have been manually mapped to the entry.
Figure 2.
Figure 2.
(A) Sample of QuickGO display page showing all GO terms that have been mapped electronically and manually to SWISS-PROT entry Q9Z247. (B) Sample of QuickGO display screen showing all information currently available through EBI for the GO term “protein phosphatase,” using the denormalized tree view. (C) Sample of QuickGO display screen showing all information currently available through EBI for the GO term “protein phosphatase,” using the graphical tree view.
Figure 2.
Figure 2.
(A) Sample of QuickGO display page showing all GO terms that have been mapped electronically and manually to SWISS-PROT entry Q9Z247. (B) Sample of QuickGO display screen showing all information currently available through EBI for the GO term “protein phosphatase,” using the denormalized tree view. (C) Sample of QuickGO display screen showing all information currently available through EBI for the GO term “protein phosphatase,” using the graphical tree view.
Figure 2.
Figure 2.
(A) Sample of QuickGO display page showing all GO terms that have been mapped electronically and manually to SWISS-PROT entry Q9Z247. (B) Sample of QuickGO display screen showing all information currently available through EBI for the GO term “protein phosphatase,” using the denormalized tree view. (C) Sample of QuickGO display screen showing all information currently available through EBI for the GO term “protein phosphatase,” using the graphical tree view.
Figure 3.
Figure 3.
Searching Gene Ontology Annotation (GOA) database with Sequence Retrieval System (SRS). (A) To find all annotated proteins that function as electron transfer flavoproteins and that have an experimental evidence code (Non-IEA), the “goid” field is searched for the GO identifier “0008246”in the GOA database. In the “combined searches with” section of the tool bar, the “BUTNOT” option is selected and ”IEA” (the GO evidence code for “inferred from electronic annotation”) is entered in the “evidence” field. (B) This produces a query result, which displays all proteins manually assigned the function of “electron transport flavoprotein” using published literature. Associations made by electronic inference are filtered out and results displayed in the “gene association file” format. (C, D) A further facility of SRS is its ability to link to databases that may or may not contain direct references to each other. As such, the last search can be extended to display EMBL/GenBank/DDBJ accession numbers by selecting the “link” option and choosing the EMBL database and “submit link.”
Figure 4.
Figure 4.
Example of a table from the proteome analysis database showing the general statistics for the number of proteins in the human proteome that can be assigned to a selection of high-level terms (GO Slim) from each of the three gene ontologies.
Figure 5.
Figure 5.
Percentage of proteins associated with GO terms from each ontology, using the interpro2go and spkw2go mappings (interpro2go = mapping of InterPro entries to GO terms; spkw2go = mapping of SWISS-PROT keywords to GO).

References

    1. Apweiler R. 2001. Functional information in SWISS-PROT: The basis for large-scale characterisation of protein sequences. Brief. Bioinform. 2: 9-18. - PubMed
    1. Apweiler R., Attwood, T.K., Bairoch, A., Bateman, A., Birney, E., Biswas, M., Bucher, P., Cerutti, L., Corpet, F., Croning, M.D., et al. 2001a. The InterPro database, an integrated documentation resource for protein families, domains and functional sites. Nucleic Acids Res. 29: 37-40. - PMC - PubMed
    1. Apweiler R., Biswas, M., Fleischmann, W., Kanapin, A., Karavidopoulou, Y., Kersey, P., Kriventseva, E.V., Mittard, V., Mulder, N., Phan, I., et al. 2001b. Proteome analysis database: Online application of InterPro and CluSTr for the functional classification of proteins in whole genomes. Nucleic Acids Res. 29: 44-48. - PMC - PubMed
    1. Attwood T.K., Blythe, M.J., Flower, D.R., Gaulton, A., Mabey, J.E., Maudling, N., McGregor, L., Mitchell, A.L., Moulton, G., Paine, K., et al. 2002. PRINTS and PRINTS-S shed light on protein ancestry. Nucleic Acids Res. 30: 239-241. - PMC - PubMed
    1. Bairoch A. 2000. The ENZYME database in 2000. Nucleic Acids Res. 28: 304-305. - PMC - PubMed

Publication types

LinkOut - more resources