Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Nov 13:35:23.
doi: 10.1186/s42826-019-0023-z. eCollection 2019.

TarGo: network based target gene selection system for human disease related mouse models

Affiliations

TarGo: network based target gene selection system for human disease related mouse models

Daejin Hyung et al. Lab Anim Res. .

Abstract

Genetically engineered mouse models are used in high-throughput phenotyping screens to understand genotype-phenotype associations and their relevance to human diseases. However, not all mutant mouse lines with detectable phenotypes are associated with human diseases. Here, we propose the "Target gene selection system for Genetically engineered mouse models" (TarGo). Using a combination of human disease descriptions, network topology, and genotype-phenotype correlations, novel genes that are potentially related to human diseases are suggested. We constructed a gene interaction network using protein-protein interactions, molecular pathways, and co-expression data. Several repositories for human disease signatures were used to obtain information on human disease-related genes. We calculated disease- or phenotype-specific gene ranks using network topology and disease signatures. In conclusion, TarGo provides many novel features for gene function prediction.

Keywords: Bioinformatics; Database; Genetic engineered mice; PageRank algorithm; Systems biology.

PubMed Disclaimer

Conflict of interest statement

Competing interestsNone declared

Figures

Fig. 1
Fig. 1
Prediction of gene-phenotype association using interaction network and signature genes. The hyperlink matrix and dangling matrix (no outgoing edge matrix) were constructed from interaction databases (red box). The weight matrix was constructed using public annotation databases (orange box). The d represented the dumping factor (0.85). The TSPR score represented the gene association for that particular phenotype. For this given TSPR score vector, A and D were signature genes. The seed nodes were selected from the top three ranking TSPR scores. A and D were good seeds because these two genes were signature genes in the input phenotype and ranked among the top three in the TSPR result. Therefore, d is 2 in this figure, meaning the good seed vector is ½. All other cases were given 0. Finally, phenotype-associated genes were selected from those with a high TrustRank score and low Spammass score
Fig. 2
Fig. 2
ROC curve for TarGo prediction results. a In the MP term, the AUC score of TSPR is 0.66, TrustRank is 0.82, and SM filter is 0.91. b In the MeSH term, the AUC score of TSPR is 0.63, TrustRank is 0.95, and SM filter is 0.93. c Overlapping between IMPC and TarGo. the X-axis indicates high-ranking genes sorted by rank score. The Y-axis is the overlapping percentage between IMPC and TarGo
Fig. 3
Fig. 3
Contents of the TarGo database. a The user can find the knock out (KO) target gene using the TarGo search system. Known phenotype, KO mouse state, and high-related phenotype are provided using gene name (blue line). High-associated genes are provided using multiple phenotype selection (red line). b The user can analyze gene rank using in-house data. The researcher can also use various in-house data, including gene sets from the Omics approach or empirical knowledge. Therefore, the user can regenerate gene rank with user-defined gene sets or network (green line)

Similar articles

References

    1. Koscielny G, Yaikhom G, Iyer V, Meehan TF, Morgan H, Atienza-Herrero J, Blake A, Chen CK, Easty R, Di Fenza A, Fiegel T, Grifiths M, Horne A, Karp NA, Kurbatova N, Mason JC, Matthews P, Oakley DJ, Qazi A, Regnart J, Retha A, Santos LA, Sneddon DJ, Warren J, Westerberg H, Wilson RJ, Melvin DG, Smedley D, Brown SD, Flicek P, Skarnes WC, Mallon AM, Parkinson H. The International Mouse Phenotyping Consortium Web Portal, a unified point of access for knockout mice and related phenotyping data. Nucleic Acids Res 2013. - PMC - PubMed
    1. Ala U, Piro RM, Grassi E, Damasco C, Silengo L, Oti M, Provero P, Di Cunto F. Prediction of human disease genes by human-mouse conserved coexpression analysis. PLoS Comput Biol. 2008;4(3):e1000043. doi: 10.1371/journal.pcbi.1000043. - DOI - PMC - PubMed
    1. Guan Y, Gorenshteyn D, Burmeister M, Wong AK, Schimenti JC, Handel MA, Bult CJ, Hibbs MA, Troyanskaya OG. Tissue-specific functional networks for prioritizing phenotype and disease genes. PLoS Comput Biol. 2012;8(9):e1002694. doi: 10.1371/journal.pcbi.1002694. - DOI - PMC - PubMed
    1. Singh-Blom UM, Natarajan N, Tewari A, Woods JO, Dhillon IS, Marcotte EM. Prediction and validation of gene-disease associations using methods inspired by social network analyses. PLoS One. 2013;8(5):e58977. doi: 10.1371/journal.pone.0058977. - DOI - PMC - PubMed
    1. Robinson PN, Mundlos S. The human phenotype ontology. Clin Genet. 2010;77(6):525–534. doi: 10.1111/j.1399-0004.2010.01436.x. - DOI - PubMed

LinkOut - more resources