The biological coherence of human phenome databases
- PMID: 20004759
- PMCID: PMC2790572
- DOI: 10.1016/j.ajhg.2009.10.026
The biological coherence of human phenome databases
Abstract
Disease networks are increasingly explored as a complement to networks centered around interactions between genes and proteins. The quality of disease networks is heavily dependent on the amount and quality of phenotype information in phenotype databases of human genetic diseases. We explored which aspects of phenotype database architecture and content best reflect the underlying biology of disease. We used the OMIM-based HPO, Orphanet, and POSSUM phenotype databases for this purpose and devised a biological coherence score based on the sharing of gene ontology annotation to investigate the degree to which phenotype similarity in these databases reflects related pathobiology. Our analyses support the notion that a fine-grained phenotype ontology enhances the accuracy of phenome representation. In addition, we find that the OMIM database that is most used by the human genetics community is heavily underannotated. We show that this problem can easily be overcome by simply adding data available in the POSSUM database to improve OMIM phenotype representations in the HPO. Also, we find that the use of feature frequency estimates--currently implemented only in the Orphanet database--significantly improves the quality of the phenome representation. Our data suggest that there is much to be gained by improving human phenome databases and that some of the measures needed to achieve this are relatively easy to implement. More generally, we propose that curation and more systematic annotation of human phenome databases can greatly improve the power of the phenotype for genetic disease analysis.
Figures


Similar articles
-
A text-mining analysis of the human phenome.Eur J Hum Genet. 2006 May;14(5):535-42. doi: 10.1038/sj.ejhg.5201585. Eur J Hum Genet. 2006. PMID: 16493445
-
HPO2Vec+: Leveraging heterogeneous knowledge resources to enrich node embeddings for the Human Phenotype Ontology.J Biomed Inform. 2019 Aug;96:103246. doi: 10.1016/j.jbi.2019.103246. Epub 2019 Jun 27. J Biomed Inform. 2019. PMID: 31255713 Free PMC article.
-
PhenoDis: a comprehensive database for phenotypic characterization of rare cardiac diseases.Orphanet J Rare Dis. 2018 Jan 25;13(1):22. doi: 10.1186/s13023-018-0765-y. Orphanet J Rare Dis. 2018. PMID: 29370821 Free PMC article.
-
The human phenotype ontology.Clin Genet. 2010 Jun;77(6):525-34. doi: 10.1111/j.1399-0004.2010.01436.x. Epub 2010 Feb 11. Clin Genet. 2010. PMID: 20412080 Review.
-
Searching Online Mendelian Inheritance in Man (OMIM): A Knowledgebase of Human Genes and Genetic Phenotypes.Curr Protoc Bioinformatics. 2017 Jun 27;58:1.2.1-1.2.12. doi: 10.1002/cpbi.27. Curr Protoc Bioinformatics. 2017. PMID: 28654725 Free PMC article. Review.
Cited by
-
Computational tools for prioritizing candidate genes: boosting disease gene discovery.Nat Rev Genet. 2012 Jul 3;13(8):523-36. doi: 10.1038/nrg3253. Nat Rev Genet. 2012. PMID: 22751426 Review.
-
Using association rule mining to determine promising secondary phenotyping hypotheses.Bioinformatics. 2014 Jun 15;30(12):i52-59. doi: 10.1093/bioinformatics/btu260. Bioinformatics. 2014. PMID: 24932005 Free PMC article.
-
Computational tools for comparative phenomics: the role and promise of ontologies.Mamm Genome. 2012 Oct;23(9-10):669-79. doi: 10.1007/s00335-012-9404-4. Epub 2012 Jul 20. Mamm Genome. 2012. PMID: 22814867 Free PMC article. Review.
-
Multiple sclerosis genetics--is the glass half full, or half empty?Nat Rev Neurol. 2010 Aug;6(8):429-37. doi: 10.1038/nrneurol.2010.91. Epub 2010 Jul 13. Nat Rev Neurol. 2010. PMID: 20625377 Review.
-
Comparative analysis of a novel disease phenotype network based on clinical manifestations.J Biomed Inform. 2015 Feb;53:113-20. doi: 10.1016/j.jbi.2014.09.007. Epub 2014 Sep 30. J Biomed Inform. 2015. PMID: 25277758 Free PMC article.
References
-
- Oti M., Brunner H.G. The modular nature of genetic diseases. Clin. Genet. 2007;71:1–11. - PubMed
-
- Oti M., Huynen M.A., Brunner H.G. Phenome connections. Trends Genet. 2008;24:103–106. - PubMed
-
- Brunner H.G., van Driel M.A. From syndrome families to functional genomics. Nat. Rev. Genet. 2004;5:545–551. - PubMed
-
- Freudenberg J., Propping P. A similarity-based method for genome-wide prediction of disease-relevant human genes. Bioinformatics. 2002;18(Suppl 2):S110–S115. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources