Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jul 11:14:1185790.
doi: 10.3389/fgene.2023.1185790. eCollection 2023.

PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary

Affiliations

PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary

Xinhua Liu et al. Front Genet. .

Abstract

Background: Phenotype similarity calculation should be used to help improve drug repurposing. In this study, based on the MeSH terms describing the phenotypes deposited in OMIM, we proposed a method, namely, PheSom (Phenotype Similarity On MeSH), to measure the similarity between phenotypes. PheSom counted the number of overlapping MeSH terms between two phenotypes and then took the weight of every MeSH term within each phenotype into account according to the term frequency-inverse document frequency (FIDC). Phenotype-related genes were used for the evaluation of our method. Results: A 7,739 × 7,739 similarity score matrix was finally obtained and the number of phenotype pairs was dramatically decreased with the increase of similarity score. Besides, the overlapping rates of phenotype-related genes were remarkably increased with the increase of similarity score between phenotypes, which supports the reliability of our method. Conclusion: We anticipate our method can be applied to identifying novel therapeutic methods for complex diseases.

Keywords: FIDC; OMIM; mesh; phenotype; similarity score.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
Flow chart of this study.
FIGURE 2
FIGURE 2
Number of phenotype pairs within different similarity score bins in VOR (A) and VW (B) methods.
FIGURE 3
FIGURE 3
The overlapping rates of phenotypes-related genes within different similarity score bins in VOR (A) and VM (B) methods. Blue histograms and red lines represented VOR/VW methods and random gene assignment processes, respectively.
FIGURE 4
FIGURE 4
Phenotype comparison results of breast cancer. (A) Distribution of similarity scores calculated by VW, VOR, Phenosim, Renisk, et al., and Jiang and Conrath (1997), between breast cancer and the 658 phenotypes that had similarity scores greater than 0 obtained through the VW method. (B) Venn diagram depicting overlaps of phenotypes that had similarity scores with breast cancer greater than 0 in all the five methods. (C) Venn diagram depicting overlaps of phenotypes that had similarity scores with breast cancer greater than 0.2 in all the five methods.

Similar articles

References

    1. Alving C. R., Matyas G. R., Torres O., Jalah R., Beck Z. (2014). Adjuvants for vaccines to drugs of abuse and addiction. Vaccine 32 (42), 5382–5389. 10.1016/j.vaccine.2014.07.085 - DOI - PMC - PubMed
    1. Amberger J., Bocchini C., Hamosh A. (2011). A new face and new challenges for Online Mendelian Inheritance in Man (OMIM®). Hum. Mutat. 32 (5), 564–567. 10.1002/humu.21466 - DOI - PubMed
    1. Botstein D., Risch N. (2003). Discovering genotypes underlying human phenotypes: Past successes for mendelian disease, future approaches for complex disease. Nat. Genet. 33, 228–237. 10.1038/ng1090 - DOI - PubMed
    1. Bruse S., Moreau M., Bromberg Y., Jang J. H., Wang N., Ha H., et al. (2016). Whole exome sequencing identifies novel candidate genes that modify chronic obstructive pulmonary disease susceptibility. Hum. Genomics 10 (1), 1. 10.1186/s40246-015-0058-7 - DOI - PMC - PubMed
    1. Butler M. G., Rafi S. K., Hossain W., Stephan D. A., Manzardo A. M. (2015). Whole exome sequencing in females with autism implicates novel and candidate genes. Int. J. Mol. Sci. 16 (1), 1312–1335. 10.3390/ijms16011312 - DOI - PMC - PubMed

LinkOut - more resources