A similarity-based method for genome-wide prediction of disease-relevant human genes
- PMID: 12385992
- DOI: 10.1093/bioinformatics/18.suppl_2.s110
A similarity-based method for genome-wide prediction of disease-relevant human genes
Abstract
Motivation: A method for prediction of disease relevant human genes from the phenotypic appearance of a query disease is presented. Diseases of known genetic origin are clustered according to their phenotypic similarity. Each cluster entry consists of a disease and its underlying disease gene. Potential disease genes from the human genome are scored by their functional similarity to known disease genes in these clusters, which are phenotypically similar to the query disease.
Results: For assessment of the approach, a leave-one-out cross-validation of 878 diseases from the OMIM database, using 10672 candidate genes from the human genome, is performed. Depending on the applied parameters, in roughly one-third of cases the true solution is contained within the top scoring 3% of predictions and in two-third of cases the true solution is contained within the top scoring 15% of predictions. The prediction results can either be used to identify target genes, when searching for a mutation in monogenic diseases or for selection of loci in genotyping experiments in genetically complex diseases.
Similar articles
-
Prediction of human disease genes by human-mouse conserved coexpression analysis.PLoS Comput Biol. 2008 Mar 28;4(3):e1000043. doi: 10.1371/journal.pcbi.1000043. PLoS Comput Biol. 2008. PMID: 18369433 Free PMC article.
-
Association of genes to genetically inherited diseases using data mining.Nat Genet. 2002 Jul;31(3):316-9. doi: 10.1038/ng895. Epub 2002 May 13. Nat Genet. 2002. PMID: 12006977
-
Phenolyzer: phenotype-based prioritization of candidate genes for human diseases.Nat Methods. 2015 Sep;12(9):841-3. doi: 10.1038/nmeth.3484. Epub 2015 Jul 20. Nat Methods. 2015. PMID: 26192085 Free PMC article.
-
Common neurodegenerative diseases: dissection by genome-wide association.Curr Neurol Neurosci Rep. 2007 Sep;7(5):425-7. doi: 10.1007/s11910-007-0065-8. Curr Neurol Neurosci Rep. 2007. PMID: 17764633 Review.
-
Searching for candidate genes in the new millennium.Clin Exp Dermatol. 2001 May;26(3):279-83. doi: 10.1046/j.1365-2230.2001.00816.x. Clin Exp Dermatol. 2001. PMID: 11422176 Review.
Cited by
-
Pinpointing disease genes through phenomic and genomic data fusion.BMC Genomics. 2015;16 Suppl 2(Suppl 2):S3. doi: 10.1186/1471-2164-16-S2-S3. Epub 2015 Jan 21. BMC Genomics. 2015. PMID: 25708473 Free PMC article.
-
Evolving knowledge graph similarity for supervised learning in complex biomedical domains.BMC Bioinformatics. 2020 Jan 3;21(1):6. doi: 10.1186/s12859-019-3296-1. BMC Bioinformatics. 2020. PMID: 31900127 Free PMC article.
-
HESML: a real-time semantic measures library for the biomedical domain with a reproducible survey.BMC Bioinformatics. 2022 Jan 6;23(1):23. doi: 10.1186/s12859-021-04539-0. BMC Bioinformatics. 2022. PMID: 34991460 Free PMC article.
-
Hypothesis-driven candidate gene association studies: practical design and analytical considerations.Am J Epidemiol. 2009 Oct 15;170(8):986-93. doi: 10.1093/aje/kwp242. Epub 2009 Sep 17. Am J Epidemiol. 2009. PMID: 19762372 Free PMC article. Review.
-
Using genome-wide expression profiling to define gene networks relevant to the study of complex traits: from RNA integrity to network topology.Int Rev Neurobiol. 2012;104:91-133. doi: 10.1016/B978-0-12-398323-7.00005-7. Int Rev Neurobiol. 2012. PMID: 23195313 Free PMC article. Review.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical