Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 May 4;224(1):iyad031.
doi: 10.1093/genetics/iyad031.

The Gene Ontology knowledgebase in 2023

Gene Ontology ConsortiumSuzi A AleksanderJames BalhoffSeth CarbonJ Michael CherryHarold J DrabkinDustin EbertMarc FeuermannPascale GaudetNomi L HarrisDavid P HillRaymond LeeHuaiyu MiSierra MoxonChristopher J MungallAnushya MuruganuganTremayne MushayahamaPaul W SternbergPaul D ThomasKimberly Van AukenJolene RamseyDeborah A SiegeleRex L ChisholmPetra FeyMaria Cristina AspromonteMaria Victoria NugnesFederica QuagliaSilvio TosattoMichelle GiglioSuvarna NadendlaGiulia AntonazzoHelen AttrillGil Dos SantosSteven MarygoldVictor StreletsChristopher J TaboneJim ThurmondPinglei ZhouSaadullah H AhmedPraoparn AsanitthongDiana Luna BuitragoMeltem N ErdolMatthew C GageMohamed Ali KadhumKan Yan Chloe LiMiao LongAleksandra MichalakAngeline PesalaArmalya PritazahraShirin C C SaverimuttuRenzhi SuKate E ThurlowRuth C LoveringColin LogieSnezhana OliferenkoJudith BlakeKaren ChristieLori CorbaniMary E DolanHarold J DrabkinDavid P HillLi NiDmitry SitnikovCynthia SmithAlayne CuzickJames SeagerLaurel CooperJustin ElserPankaj JaiswalParul GuptaPankaj JaiswalSushma NaithaniManuel Lera-RamirezKim RutherfordValerie WoodJeffrey L De PonsMelinda R DwinellG Thomas HaymanMary L KaldunskiAnne E KwitekStanley J F LaulederkindMarek A TutajMahima VediShur-Jen WangPeter D'EustachioLucila AimoKristian AxelsenAlan BridgeNevila Hyka-NouspikelAnne MorgatSuzi A AleksanderJ Michael CherryStacia R EngelKalpana KarraStuart R MiyasatoRobert S NashMarek S SkrzypekShuai WengEdith D WongErika BakkerTanya Z BerardiniLeonore ReiserAndrea AuchinclossKristian AxelsenGhislaine Argoud-PuyMarie-Claude BlatterEmmanuel BoutetLionel BreuzaAlan BridgeCristina Casals-CasasElisabeth CoudertAnne EstreicherMaria Livia FamigliettiMarc FeuermannArnaud GosNadine Gruaz-GumowskiChantal HuloNevila Hyka-NouspikelFlorence JungoPhilippe Le MercierDamien LieberherrPatrick MassonAnne MorgatIvo PedruzziLucille PourcelSylvain PouxCatherine RivoireShyamala SundaramAlex BatemanEmily Bowler-BarnettHema Bye-A-JeePaul DennyAlexandr IgnatchenkoRizwan IshtiaqAntonia LockYvonne LussiMichele MagraneMaria J MartinSandra OrchardPedro RaposoElena SperettaNidhi TyagiKate WarnerRossana ZaruAlexander D DiehlRaymond LeeJuancarlos ChanStavros DiamantakisDaniela RacitiMagdalena ZarowieckiMalcolm FisherChristina James-ZornVirgilio PonferradaAaron ZornSridhar RamachandranLeyla RuzickaMonte Westerfield
Collaborators
Review

The Gene Ontology knowledgebase in 2023

Gene Ontology Consortium et al. Genetics. .

Abstract

The Gene Ontology (GO) knowledgebase (http://geneontology.org) is a comprehensive resource concerning the functions of genes and gene products (proteins and noncoding RNAs). GO annotations cover genes from organisms across the tree of life as well as viruses, though most gene function knowledge currently derives from experiments carried out in a relatively small number of model organisms. Here, we provide an updated overview of the GO knowledgebase, as well as the efforts of the broad, international consortium of scientists that develops, maintains, and updates the GO knowledgebase. The GO knowledgebase consists of three components: (1) the GO-a computational knowledge structure describing the functional characteristics of genes; (2) GO annotations-evidence-supported statements asserting that a specific gene product has a particular functional characteristic; and (3) GO Causal Activity Models (GO-CAMs)-mechanistic models of molecular "pathways" (GO biological processes) created by linking multiple GO annotations using defined relations. Each of these components is continually expanded, revised, and updated in response to newly published discoveries and receives extensive QA checks, reviews, and user feedback. For each of these components, we provide a description of the current contents, recent developments to keep the knowledgebase up to date with new discoveries, and guidance on how users can best make use of the data that we provide. We conclude with future directions for the project.

Keywords: Gene Ontology; gene annotation; gene function; knowledge graphs; knowledgebase.

PubMed Disclaimer

Conflict of interest statement

Conflicts of interest The authors declare no conflicts of interest.

Figures

Fig. 1.
Fig. 1.
Examples of the three components of the GO knowledgebase. a) The GO ontology consists of terms, e.g. DNA binding transcription factor activity, and relationships between the terms (arrows; black = is a, blue = part of, and orange = regulates). b) GO annotations associate a specific gene product (here, human ZNF410) with GO terms asserting its functional aspects (“GO Class” column, e.g. sequence-specific double-stranded DNA binding) and the evidence for each assertion with its traceable source (“Evidence” and “Reference” columns). c) The GO-CAM model combines individual GO annotations into a model, in this case a very simple model describing how human ZNF410 acts as a transcription factor to positively regulate (denoted by the arrow) transcription of the CHD4 gene, which in turn acts as a corepressor to repress (denoted by dashed lines) transcription of fetal hemoglobin genes (HBG1 and HBG2) in erythroid lineage cells. In this view, each box in the GO-CAM is labeled with the gene product and species abbreviation for simplicity.
Fig. 2.
Fig. 2.
GO-CAM model of the SARS-CoV2—host interactions as displayed using the GO-CAM Pathway Widget (code available at https://github.com/geneontology/wc-gocam-viz) on the Alliance of Genome Resources gene pages (https://www.alliancegenome.org/gene/HGNC:20144#pathways). The model includes proteins from both humans (Hsap) and the SARS-CoV-2 virus (Scov2). A simplified representation of the causal model is shown on the main figure, which is simplified by labeling each activity with the gene and organism. The model includes many additional details, which are displayed as “cards;” the information for MAVS activity (inset) which normally acts as a signaling adaptor located in the mitochondrial membrane. MAVS activity is suppressed directly by the SARS-CoV-2M protein and indirectly by other SARS-CoV-2 proteins. Each of the “E” symbols on the right-hand side can be clicked to see the evidence for each assertion in the model.
Fig. 3.
Fig. 3.
Alliance ribbon view for the yeast RPB7 gene. High-level GO categories annotated are shown in shaded squares (https://www.alliancegenome.org/gene/SGD:S000002812); darker shading indicates more annotations in that category.

Similar articles

Cited by

References

    1. Alliance of Genome Resources Consortium . Harmonizing model organism data in the Alliance of Genome Resources. Genetics. 2022;220:iyac022. doi:10.1093/genetics/iyac022. - DOI - PMC - PubMed
    1. Altman T, Travers M, Kothari A, Caspi R, Karp PD. A systematic comparison of the MetaCyc and KEGG pathway databases. BMC Bioinformatics. 2013;14(1):112. doi:10.1186/1471-2105-14-112. - DOI - PMC - PubMed
    1. Ambrus JL Jr, Pippin J, Joseph A, Xu C, Blumenthal D, Tamayo A, Claypool K, McCourt D, Srikiatchatochorn A, Ford RJ. Identification of a cDNA for a human high-molecular-weight B-cell growth factor. Proc Natl Acad Sci U S A. 1993;90(13):6330–6334. doi:10.1073/pnas.90.13.6330. - DOI - PMC - PubMed
    1. Ambrus JL Jr, Pippin J, Joseph A, Xu C, Blumenthal D, Tamayo A, Claypool K, McCourt D, Srikiatchatochorn A, Ford R. Identification of a cDNA for a human high molecular-weight B-cell growth factor. Proc Natl Acad Sci U S A. 1996;93(15):8154. doi:10.1073/pnas.93.15.8154-b. - DOI - PMC - PubMed
    1. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000;25(1):25–29. doi:10.1038/75556. - DOI - PMC - PubMed

Publication types