Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Feb 9;10(2):e0115692.
doi: 10.1371/journal.pone.0115692. eCollection 2015.

HPOSim: an R package for phenotypic similarity measure and enrichment analysis based on the human phenotype ontology

Affiliations

HPOSim: an R package for phenotypic similarity measure and enrichment analysis based on the human phenotype ontology

Yue Deng et al. PLoS One. .

Abstract

Background: Phenotypic features associated with genes and diseases play an important role in disease-related studies and most of the available methods focus solely on the Online Mendelian Inheritance in Man (OMIM) database without considering the controlled vocabulary. The Human Phenotype Ontology (HPO) provides a standardized and controlled vocabulary covering phenotypic abnormalities in human diseases, and becomes a comprehensive resource for computational analysis of human disease phenotypes. Most of the existing HPO-based software tools cannot be used offline and provide only few similarity measures. Therefore, there is a critical need for developing a comprehensive and offline software for phenotypic features similarity based on HPO.

Results: HPOSim is an R package for analyzing phenotypic similarity for genes and diseases based on HPO data. Seven commonly used semantic similarity measures are implemented in HPOSim. Enrichment analysis of gene sets and disease sets are also implemented, including hypergeometric enrichment analysis and network ontology analysis (NOA).

Conclusions: HPOSim can be used to predict disease genes and explore disease-related function of gene modules. HPOSim is open source and freely available at SourceForge (https://sourceforge.net/p/hposim/).

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Framework of HPOSim.
Users can use HPOSim to calculate semantic similarity for HPO terms, genes and diseases. HPOSim can also be used to identify enriched HPO terms for gene set and disease set.
Figure 2
Figure 2. Example of the structure of HPO.
HPO term Abnormality of the joints of the lower limbs (HP:0100491) and all its ancestor terms are shown. Each term in the HPO describes a phenotypic abnormality. Terms are related to parent terms by “is a” relationships in the form of a directed acyclic graph. If a disease or a gene is annotated to a term, it will also be annotated to all of its ancestors.

Similar articles

Cited by

  • Investigations on factors influencing HPO-based semantic similarity calculation.
    Peng J, Li Q, Shang X. Peng J, et al. J Biomed Semantics. 2017 Sep 20;8(Suppl 1):34. doi: 10.1186/s13326-017-0144-y. J Biomed Semantics. 2017. PMID: 29297376 Free PMC article.
  • Pheno-Ranker: a toolkit for comparison of phenotypic data stored in GA4GH standards and beyond.
    Leist IC, Rivas-Torrubia M, Alarcón-Riquelme ME, Barturen G, Consortium PC, Gut IG, Rueda M. Leist IC, et al. BMC Bioinformatics. 2024 Dec 4;25(1):373. doi: 10.1186/s12859-024-05993-2. BMC Bioinformatics. 2024. PMID: 39633268 Free PMC article.
  • PubCaseFinder: A Case-Report-Based, Phenotype-Driven Differential-Diagnosis System for Rare Diseases.
    Fujiwara T, Yamamoto Y, Kim JD, Buske O, Takagi T. Fujiwara T, et al. Am J Hum Genet. 2018 Sep 6;103(3):389-399. doi: 10.1016/j.ajhg.2018.08.003. Epub 2018 Aug 30. Am J Hum Genet. 2018. PMID: 30173820 Free PMC article.
  • PhenoExam: gene set analyses through integration of different phenotype databases.
    Cisterna A, González-Vidal A, Ruiz D, Ortiz J, Gómez-Pascual A, Chen Z, Nalls M, Faghri F, Hardy J, Díez I, Maietta P, Álvarez S, Ryten M, Botía JA. Cisterna A, et al. BMC Bioinformatics. 2022 Dec 31;23(1):567. doi: 10.1186/s12859-022-05122-x. BMC Bioinformatics. 2022. PMID: 36587217 Free PMC article.
  • The genomic landscape of balanced cytogenetic abnormalities associated with human congenital anomalies.
    Redin C, Brand H, Collins RL, Kammin T, Mitchell E, Hodge JC, Hanscom C, Pillalamarri V, Seabra CM, Abbott MA, Abdul-Rahman OA, Aberg E, Adley R, Alcaraz-Estrada SL, Alkuraya FS, An Y, Anderson MA, Antolik C, Anyane-Yeboa K, Atkin JF, Bartell T, Bernstein JA, Beyer E, Blumenthal I, Bongers EM, Brilstra EH, Brown CW, Brüggenwirth HT, Callewaert B, Chiang C, Corning K, Cox H, Cuppen E, Currall BB, Cushing T, David D, Deardorff MA, Dheedene A, D'Hooghe M, de Vries BB, Earl DL, Ferguson HL, Fisher H, FitzPatrick DR, Gerrol P, Giachino D, Glessner JT, Gliem T, Grady M, Graham BH, Griffis C, Gripp KW, Gropman AL, Hanson-Kahn A, Harris DJ, Hayden MA, Hill R, Hochstenbach R, Hoffman JD, Hopkin RJ, Hubshman MW, Innes AM, Irons M, Irving M, Jacobsen JC, Janssens S, Jewett T, Johnson JP, Jongmans MC, Kahler SG, Koolen DA, Korzelius J, Kroisel PM, Lacassie Y, Lawless W, Lemyre E, Leppig K, Levin AV, Li H, Li H, Liao EC, Lim C, Lose EJ, Lucente D, Macera MJ, Manavalan P, Mandrile G, Marcelis CL, Margolin L, Mason T, Masser-Frye D, McClellan MW, Mendoza CJ, Menten B, Middelkamp S, Mikami LR, Moe E, Mohammed S, Mononen T, Mortenson ME, Moya G, Nieuwint AW, Ordulu Z, Parkash S, Pauker SP, Pereira… See abstract for full author list ➔ Redin C, et al. Nat Genet. 2017 Jan;49(1):36-45. doi: 10.1038/ng.3720. Epub 2016 Nov 14. Nat Genet. 2017. PMID: 27841880 Free PMC article.

References

    1. Köhler S, Bauer S, Horn D, Robinson PN (2008) Walking the Interactome for Prioritization of Candidate Disease Genes. Am J Hum Genet 82: 949–958. 10.1016/j.ajhg.2008.02.013 - DOI - PMC - PubMed
    1. Van Driel MA, Bruggeman J, Vriend G, Brunner HG, Leunissen JAM (2006) A text-mining analysis of the human phenome. Eur J Hum Genet 14: 535–542. 10.1038/sj.ejhg.5201585 - DOI - PubMed
    1. Oti M, Brunner H (2007) The modular nature of genetic diseases. Clinical Genetics 71: 1–11. 10.1111/j.1399-0004.2006.00708.x - DOI - PubMed
    1. Lage K, Karlberg EO, Størling ZM, Olason PI, Pedersen AG, et al. (2007) A human phenome-interactome network of protein complexes implicated in genetic disorders. Nat Biotechnol 25: 309–316. 10.1038/nbt1295 - DOI - PubMed
    1. Freudenberg J, Propping P (2002) A similarity-based method for genome-wide prediction of disease-relevant human genes. Bioinformatics 18: S110–S115. 10.1093/bioinformatics/18.suppl_2.S110 - DOI - PubMed

Publication types