A new web-based data mining tool for the identification of candidate genes for human genetic disorders

Marc A van Driel¹, Koen Cuelenaere, Patrick P C W Kemmeren, Jack A M Leunissen, Han G Brunner

Affiliations

PMID: 12529706
DOI: 10.1038/sj.ejhg.5200918

A new web-based data mining tool for the identification of candidate genes for human genetic disorders

Marc A van Driel et al. Eur J Hum Genet. 2003 Jan.

. 2003 Jan;11(1):57-63.

doi: 10.1038/sj.ejhg.5200918.

Authors

Marc A van Driel¹, Koen Cuelenaere, Patrick P C W Kemmeren, Jack A M Leunissen, Han G Brunner

Affiliation

¹ Centre for Molecular and Biomolecular Informatics, University of Nijmegen, The Netherlands. M.vanDriel@cmbi.kun.nl

PMID: 12529706
DOI: 10.1038/sj.ejhg.5200918

Abstract

To identify the gene underlying a human genetic disorder can be difficult and time-consuming. Typically, positional data delimit a chromosomal region that contains between 20 and 200 genes. The choice then lies between sequencing large numbers of genes, or setting priorities by combining positional data with available expression and phenotype data, contained in different internet databases. This process of examining positional candidates for possible functional clues may be performed in many different ways, depending on the investigator's knowledge and experience. Here, we report on a new tool called the GeneSeeker, which gathers and combines positional data and expression/phenotypic data in an automated way from nine different web-based databases. This results in a quick overview of interesting candidate genes in the region of interest. The GeneSeeker system is built in a modular fashion allowing for easy addition or removal of databases if required. Databases are searched directly through the web, which obviates the need for data warehousing. In order to evaluate the GeneSeeker tool, we analysed syndromes with known genesis. For each of 10 syndromes the GeneSeeker programme generated a shortlist that contained a significantly reduced number of candidate genes from the critical region, yet still contained the causative gene. On average, a list of 163 genes based on position alone was reduced to a more manageable list of 22 genes based on position and expression or phenotype information. We are currently expanding the tool by adding other databases. The GeneSeeker is available via the web-interface (http://www.cmbi.kun.nl/GeneSeeker/).

PubMed Disclaimer

Cited by

Genome-wide identification of genes likely to be involved in human genetic disease.
López-Bigas N, Ouzounis CA. López-Bigas N, et al. Nucleic Acids Res. 2004 Jun 4;32(10):3108-14. doi: 10.1093/nar/gkh605. Print 2004. Nucleic Acids Res. 2004. PMID: 15181176 Free PMC article.
Gene prioritization of resistant rice gene against Xanthomas oryzae pv. oryzae by using text mining technologies.
Xia J, Zhang X, Yuan D, Chen L, Webster J, Fang AC. Xia J, et al. Biomed Res Int. 2013;2013:853043. doi: 10.1155/2013/853043. Epub 2013 Nov 25. Biomed Res Int. 2013. PMID: 24371834 Free PMC article.
Text mining in cancer gene and pathway prioritization.
Luo Y, Riedlinger G, Szolovits P. Luo Y, et al. Cancer Inform. 2014 Oct 13;13(Suppl 1):69-79. doi: 10.4137/CIN.S13874. eCollection 2014. Cancer Inform. 2014. PMID: 25392685 Free PMC article. Review.
Integration of text- and data-mining using ontologies successfully selects disease gene candidates.
Tiffin N, Kelso JF, Powell AR, Pan H, Bajic VB, Hide WA. Tiffin N, et al. Nucleic Acids Res. 2005 Mar 14;33(5):1544-52. doi: 10.1093/nar/gki296. Print 2005. Nucleic Acids Res. 2005. PMID: 15767279 Free PMC article.
POCUS: mining genomic sequence annotation to predict disease genes.
Turner FS, Clutterbuck DR, Semple CA. Turner FS, et al. Genome Biol. 2003;4(11):R75. doi: 10.1186/gb-2003-4-11-r75. Epub 2003 Oct 10. Genome Biol. 2003. PMID: 14611661 Free PMC article.

See all "Cited by" articles

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Nature Publishing Group
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A new web-based data mining tool for the identification of candidate genes for human genetic disorders

Affiliation

A new web-based data mining tool for the identification of candidate genes for human genetic disorders

Authors

Affiliation

Abstract

Similar articles

Cited by

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Medical

Abstract

Similar articles

Cited by

Publication types

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Medical