The Unified Phenotype Ontology : a framework for cross-species integrative phenomics
- PMID: 40048704
- PMCID: PMC11912833
- DOI: 10.1093/genetics/iyaf027
The Unified Phenotype Ontology : a framework for cross-species integrative phenomics
Abstract
Phenotypic data are critical for understanding biological mechanisms and consequences of genomic variation, and are pivotal for clinical use cases such as disease diagnostics and treatment development. For over a century, vast quantities of phenotype data have been collected in many different contexts covering a variety of organisms. The emerging field of phenomics focuses on integrating and interpreting these data to inform biological hypotheses. A major impediment in phenomics is the wide range of distinct and disconnected approaches to recording the observable characteristics of an organism. Phenotype data are collected and curated using free text, single terms or combinations of terms, using multiple vocabularies, terminologies, or ontologies. Integrating these heterogeneous and often siloed data enables the application of biological knowledge both within and across species. Existing integration efforts are typically limited to mappings between pairs of terminologies; a generic knowledge representation that captures the full range of cross-species phenomics data is much needed. We have developed the Unified Phenotype Ontology (uPheno) framework, a community effort to provide an integration layer over domain-specific phenotype ontologies, as a single, unified, logical representation. uPheno comprises (1) a system for consistent computational definition of phenotype terms using ontology design patterns, maintained as a community library; (2) a hierarchical vocabulary of species-neutral phenotype terms under which their species-specific counterparts are grouped; and (3) mapping tables between species-specific ontologies. This harmonized representation supports use cases such as cross-species integration of genotype-phenotype associations from different organisms and cross-species informed variant prioritization.
Keywords: integration; ontology; phenotype; semantics.
© The Author(s) 2025. Published by Oxford University Press on behalf of The Genetics Society of America.
Conflict of interest statement
Conflicts of interest: The author(s) declare no conflicts of interest.
Figures
Update of
-
The Unified Phenotype Ontology (uPheno): A framework for cross-species integrative phenomics.bioRxiv [Preprint]. 2024 Sep 22:2024.09.18.613276. doi: 10.1101/2024.09.18.613276. bioRxiv. 2024. Update in: Genetics. 2025 Mar 17;229(3):iyaf027. doi: 10.1093/genetics/iyaf027. PMID: 39345458 Free PMC article. Updated. Preprint.
References
MeSH terms
Grants and funding
- U41HG002273/Gene Ontology Consortium
- U41 HG001315/HG/NHGRI NIH HHS/United States
- Sanofi
- 7R24 OD011883/EMBL-EBI Core Funds
- Open Targets
- U24 HG010859/HG/NHGRI NIH HHS/United States
- Celgene
- U24 HG011449/HG/NHGRI NIH HHS/United States
- WT_/Wellcome Trust/United Kingdom
- Office of Science
- RM1 HG010860/HG/NHGRI NIH HHS/United States
- 1U24HG012542-01/HG/NHGRI NIH HHS/United States
- NIH-NHGRI # 5RM1 HG010860/NIH National Human Genome Research Institute Phenomics First Resource
- Biogen
- U24HG010859)/Alliance of Genome Resources
- Takeda
- U24 HG012542/HG/NHGRI NIH HHS/United States
- U41 HG002273/HG/NHGRI NIH HHS/United States
- P41 HD064556/HD/NICHD NIH HHS/United States
- OD R24 OD011883/GF/NIH HHS/United States
- P41 HG000330/HG/NHGRI NIH HHS/United States
- CD/ODCDC CDC HHS/United States
- BB/X010953/1/BBSRC Growing Health
- #5R24 OD011883/NH/NIH HHS/United States
- R24 OD011883/OD/NIH HHS/United States
- 218236/Z/19/Z/Wellcome Grant
- BB/X011003/1/Delivering Sustainable Wheat
- U41 HG002659/HG/NHGRI NIH HHS/United States
- DE-AC0205CH11231/US Department of Energy
- Dicty database and Stock Center
- UM1 HG006370/HG/NHGRI NIH HHS/United States
- Center of Excellence in Genomic Science
- EMBL-EBI
- GSK
- Office of Basic Energy Sciences
