Linking human diseases to animal models using ontology-based phenotype annotation
- PMID: 19956802
- PMCID: PMC2774506
- DOI: 10.1371/journal.pbio.1000247
Linking human diseases to animal models using ontology-based phenotype annotation
Abstract
Scientists and clinicians who study genetic alterations and disease have traditionally described phenotypes in natural language. The considerable variation in these free-text descriptions has posed a hindrance to the important task of identifying candidate genes and models for human diseases and indicates the need for a computationally tractable method to mine data resources for mutant phenotypes. In this study, we tested the hypothesis that ontological annotation of disease phenotypes will facilitate the discovery of new genotype-phenotype relationships within and across species. To describe phenotypes using ontologies, we used an Entity-Quality (EQ) methodology, wherein the affected entity (E) and how it is affected (Q) are recorded using terms from a variety of ontologies. Using this EQ method, we annotated the phenotypes of 11 gene-linked human diseases described in Online Mendelian Inheritance in Man (OMIM). These human annotations were loaded into our Ontology-Based Database (OBD) along with other ontology-based phenotype descriptions of mutants from various model organism databases. Phenotypes recorded with this EQ method can be computationally compared based on the hierarchy of terms in the ontologies and the frequency of annotation. We utilized four similarity metrics to compare phenotypes and developed an ontology of homologous and analogous anatomical structures to compare phenotypes between species. Using these tools, we demonstrate that we can identify, through the similarity of the recorded phenotypes, other alleles of the same gene, other members of a signaling pathway, and orthologous genes and pathway members across species. We conclude that EQ-based annotation of phenotypes, in conjunction with a cross-species ontology, and a variety of similarity metrics can identify biologically meaningful similarities between genes by comparing phenotypes alone. This annotation and search method provides a novel and efficient means to identify gene candidates and animal models of human disease, which may shorten the lengthy path to identification and understanding of the genetic basis of human disease.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures







Similar articles
-
Evolutionary characters, phenotypes and ontologies: curating data from the systematic biology literature.PLoS One. 2010 May 20;5(5):e10708. doi: 10.1371/journal.pone.0010708. PLoS One. 2010. PMID: 20505755 Free PMC article.
-
Practical application of ontologies to annotate and analyse large scale raw mouse phenotype data.BMC Bioinformatics. 2009 May 6;10 Suppl 5(Suppl 5):S2. doi: 10.1186/1471-2105-10-S5-S2. BMC Bioinformatics. 2009. PMID: 19426459 Free PMC article.
-
Annotation of phenotypes using ontologies: a gold standard for the training and evaluation of natural language processing systems.Database (Oxford). 2018 Jan 1;2018:bay110. doi: 10.1093/database/bay110. Database (Oxford). 2018. PMID: 30576485 Free PMC article.
-
The mammalian phenotype ontology: enabling robust annotation and comparative analysis.Wiley Interdiscip Rev Syst Biol Med. 2009 Nov-Dec;1(3):390-399. doi: 10.1002/wsbm.44. Wiley Interdiscip Rev Syst Biol Med. 2009. PMID: 20052305 Free PMC article. Review.
-
[Review on the research progress of mining of OMIM data].Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2014 Dec;31(6):1400-4. Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2014. PMID: 25868267 Review. Chinese.
Cited by
-
Mining skeletal phenotype descriptions from scientific literature.PLoS One. 2013;8(2):e55656. doi: 10.1371/journal.pone.0055656. Epub 2013 Feb 8. PLoS One. 2013. PMID: 23409017 Free PMC article.
-
Capturing domain knowledge from multiple sources: the rare bone disorders use case.J Biomed Semantics. 2015 Apr 17;6:21. doi: 10.1186/s13326-015-0008-2. eCollection 2015. J Biomed Semantics. 2015. PMID: 25926964 Free PMC article.
-
Phenotypic overlap in the contribution of individual genes to CNV pathogenicity revealed by cross-species computational analysis of single-gene mutations in humans, mice and zebrafish.Dis Model Mech. 2013 Mar;6(2):358-72. doi: 10.1242/dmm.010322. Epub 2012 Oct 25. Dis Model Mech. 2013. PMID: 23104991 Free PMC article.
-
Logical Gene Ontology Annotations (GOAL): exploring gene ontology annotations with OWL.J Biomed Semantics. 2012 Apr 24;3 Suppl 1(Suppl 1):S3. doi: 10.1186/2041-1480-3-S1-S3. J Biomed Semantics. 2012. PMID: 22541594 Free PMC article.
-
Linking gene expression to phenotypes via pathway information.J Biomed Semantics. 2015 Apr 11;6:17. doi: 10.1186/s13326-015-0013-5. eCollection 2015. J Biomed Semantics. 2015. PMID: 25901272 Free PMC article.
References
-
- Collin G. B, Marshall J. D, Ikeda A, So W. V, Russell-Eggitt I, et al. Mutations in ALMS1 cause obesity, type 2 diabetes and neurosensory degeneration in Alstrom syndrome. Nat Genet. 2002;31:74–78. - PubMed
-
- Arsov T, Silva D. G, O'Bryan M. K, Sainsbury A, Lee N. J, et al. Fat aussie–a new Alstrom syndrome mouse showing a critical role for ALMS1 in obesity, diabetes, and spermatogenesis. Mol Endocrinol. 2006;20:1610–1622. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases