Data integration and genomic medicine
- PMID: 16574494
- DOI: 10.1016/j.jbi.2006.02.007
Data integration and genomic medicine
Abstract
Genomic medicine aims to revolutionize health care by applying our growing understanding of the molecular basis of disease. Research in this arena is data intensive, which means data sets are large and highly heterogeneous. To create knowledge from data, researchers must integrate these large and diverse data sets. This presents daunting informatic challenges such as representation of data that is suitable for computational inference (knowledge representation), and linking heterogeneous data sets (data integration). Fortunately, many of these challenges can be classified as data integration problems, and technologies exist in the area of data integration that may be applied to these challenges. In this paper, we discuss the opportunities of genomic medicine as well as identify the informatics challenges in this domain. We also review concepts and methodologies in the field of data integration. These data integration concepts and methodologies are then aligned with informatics challenges in genomic medicine and presented as potential solutions. We conclude this paper with challenges still not addressed in genomic medicine and gaps that remain in data integration research to facilitate genomic medicine.
Similar articles
-
An agent- and ontology-based system for integrating public gene, protein, and disease databases.J Biomed Inform. 2007 Feb;40(1):17-29. doi: 10.1016/j.jbi.2006.02.014. Epub 2006 Mar 20. J Biomed Inform. 2007. PMID: 16621723
-
IGG: A tool to integrate GeneChips for genetic studies.Bioinformatics. 2007 Nov 15;23(22):3105-7. doi: 10.1093/bioinformatics/btm458. Epub 2007 Sep 14. Bioinformatics. 2007. PMID: 17872914
-
Interface analysis between GSVML and HL7 version 3.J Biomed Inform. 2007 Oct;40(5):527-38. doi: 10.1016/j.jbi.2006.12.006. Epub 2006 Dec 24. J Biomed Inform. 2007. PMID: 17293166
-
The genome-enabled electronic medical record.J Biomed Inform. 2007 Feb;40(1):44-6. doi: 10.1016/j.jbi.2006.02.010. Epub 2006 Mar 15. J Biomed Inform. 2007. PMID: 16616698 Review.
-
Status of clinical gene sequencing data reporting and associated risks for information loss.J Biomed Inform. 2007 Feb;40(1):47-54. doi: 10.1016/j.jbi.2006.02.012. Epub 2006 Mar 15. J Biomed Inform. 2007. PMID: 16617035 Review.
Cited by
-
Developing a European grid infrastructure for cancer research: vision, architecture and services.Ecancermedicalscience. 2007;1:56. doi: 10.3332/ecms.2007.56. Epub 2007 Sep 21. Ecancermedicalscience. 2007. PMID: 22275955 Free PMC article.
-
A genome-wide association study of red blood cell traits using the electronic medical record.PLoS One. 2010 Sep 28;5(9):e13011. doi: 10.1371/journal.pone.0013011. PLoS One. 2010. PMID: 20927387 Free PMC article.
-
Pathogen profiling for disease management and surveillance.Nat Rev Microbiol. 2007 Jun;5(6):464-70. doi: 10.1038/nrmicro1656. Epub 2007 May 8. Nat Rev Microbiol. 2007. PMID: 17487146 Free PMC article. Review.
-
Building Cancer Diagnosis Text to OncoTree Mapping Pipelines for Clinical Sequencing Data Integration and Curation.AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:440-448. eCollection 2020. AMIA Jt Summits Transl Sci Proc. 2020. PMID: 32477665 Free PMC article.
-
The semantic web in translational medicine: current applications and future directions.Brief Bioinform. 2015 Jan;16(1):89-103. doi: 10.1093/bib/bbt079. Epub 2013 Nov 6. Brief Bioinform. 2015. PMID: 24197933 Free PMC article. Review.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources