Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2024 Sep 22:2024.09.18.613276.
doi: 10.1101/2024.09.18.613276.

The Unified Phenotype Ontology (uPheno): A framework for cross-species integrative phenomics

Affiliations

The Unified Phenotype Ontology (uPheno): A framework for cross-species integrative phenomics

Nicolas Matentzoglu et al. bioRxiv. .

Update in

  • The Unified Phenotype Ontology : a framework for cross-species integrative phenomics.
    Matentzoglu N, Bello SM, Stefancsik R, Alghamdi SM, Anagnostopoulos AV, Balhoff JP, Balk MA, Bradford YM, Bridges Y, Callahan TJ, Caufield H, Cuzick A, Carmody LC, Caron AR, de Souza V, Engel SR, Fey P, Fisher M, Gehrke S, Grove C, Hansen P, Harris NL, Harris MA, Harris L, Ibrahim A, Jacobsen JOB, Köhler S, McMurry JA, Munoz-Fuentes V, Munoz-Torres MC, Parkinson H, Pendlington ZM, Pilgrim C, Robb SMC, Robinson PN, Seager J, Segerdell E, Smedley D, Sollis E, Toro S, Vasilevsky N, Wood V, Haendel MA, Mungall CJ, McLaughlin JA, Osumi-Sutherland D. Matentzoglu N, et al. Genetics. 2025 Mar 17;229(3):iyaf027. doi: 10.1093/genetics/iyaf027. Genetics. 2025. PMID: 40048704 Free PMC article.

Abstract

Phenotypic data are critical for understanding biological mechanisms and consequences of genomic variation, and are pivotal for clinical use cases such as disease diagnostics and treatment development. For over a century, vast quantities of phenotype data have been collected in many different contexts covering a variety of organisms. The emerging field of phenomics focuses on integrating and interpreting these data to inform biological hypotheses. A major impediment in phenomics is the wide range of distinct and disconnected approaches to recording the observable characteristics of an organism. Phenotype data are collected and curated using free text, single terms or combinations of terms, using multiple vocabularies, terminologies, or ontologies. Integrating these heterogeneous and often siloed data enables the application of biological knowledge both within and across species. Existing integration efforts are typically limited to mappings between pairs of terminologies; a generic knowledge representation that captures the full range of cross-species phenomics data is much needed. We have developed the Unified Phenotype Ontology (uPheno) framework, a community effort to provide an integration layer over domain-specific phenotype ontologies, as a single, unified, logical representation. uPheno comprises (1) a system for consistent computational definition of phenotype terms using ontology design patterns, maintained as a community library; (2) a hierarchical vocabulary of species-neutral phenotype terms under which their species-specific counterparts are grouped; and (3) mapping tables between species-specific ontologies. This harmonized representation supports use cases such as cross-species integration of genotype-phenotype associations from different organisms and cross-species informed variant prioritization.

PubMed Disclaimer

Conflict of interest statement

7.Conflict of Interest None declared.

Figures

Fig. 1.
Fig. 1.
Distribution of entity types in the uPheno pattern library. All phenotype definitions reference atleast one affected entity. The percentage of patterns using an entity type relative to all pattern templates are indicated. The main entity categories in uPheno phenotype pattern templates include: anatomical entity (UBERON:0001062), biological process (GO:0008150), cellular component (GO:0005575), chemical entity (CHEBI:24431), cell (CL:0000000), role (CHEBI:50906), behavior process (NBO:0000313), molecular function (GO:0003674), other entities (BFO:0000001).
Fig. 2:
Fig. 2:
A. uPheno is a framework for consistent and logical definition of phenotype categories using ontology design patterns that provides a hierarchical vocabulary of species-neutral phenotype terms under which their species-specific counterparts are grouped. The ontology design templates are based on shared features of existing phenotypic descriptions from various model organisms and represent community consensus. The phenotype-pattern template adherent terms are adopted by species-specific ontologies, thereby contributing to the community-built uPheno framework. B. uPheno accelerates cross-species inference and computationally amenable comparative phenotype analysis. For example, the interoperable representation of heart phenotypes characterized by increased size, compared with wild-type in distinct species, such as zebrafish and human, allows the cross-species identification of genes whose alleles can cause similar phenotypes. C. uPheno contextual hierarchy for increased size of the heart.
Figure 3:
Figure 3:
DOSDP pattern for the representation of abnormal anatomical entity phenotypes.Species-specific phenotype ontologies implement this pattern in phenotype terms such as “Abnormality of the cardiovascular system” (HP:0001626) and “gall bladder quality, abnormal” (ZP:0006529).
Figure 3:
Figure 3:
DOSDP pattern for the representation of abnormal anatomical entity phenotypes.Species-specific phenotype ontologies implement this pattern in phenotype terms such as “Abnormality of the cardiovascular system” (HP:0001626) and “gall bladder quality, abnormal” (ZP:0006529).

References

    1. Lima Cunha D., Arno G., Corton M. & Moosajee M. The Spectrum of PAX6 Mutations and Genotype-Phenotype Correlations in the Eye. Genes 10, (2019). - PMC - PubMed
    1. Fisher S. E. & Scharff C. FOXP2 as a molecular window into speech and language. Trends Genet. 25, 166–177 (2009). - PubMed
    1. Rodgers B. D. & Garikipati D. K. Clinical, agricultural, and evolutionary biology of myostatin: a comparative review. Endocr. Rev. 29, 513–534 (2008). - PMC - PubMed
    1. Gargano M. A. et al. The Human Phenotype Ontology in 2024: phenotypes around the world. Nucleic Acids Res. 52, D1333–D1346 (2024). - PMC - PubMed
    1. Smith C. L. & Eppig J. T. The mammalian phenotype ontology: enabling robust annotation and comparative analysis. Wiley Interdiscip. Rev. Syst. Biol. Med. 1, 390–399 (2009). - PMC - PubMed

Publication types

LinkOut - more resources