Scaling the walls of discovery: using semantic metadata for integrative problem solving
- PMID: 19304872
- DOI: 10.1093/bib/bbp007
Scaling the walls of discovery: using semantic metadata for integrative problem solving
Abstract
Current data integration approaches by bioinformaticians frequently involve extracting data from a wide variety of public and private data repositories, each with a unique vocabulary and schema, via scripts. These separate data sets must then be normalized through the tedious and lengthy process of resolving naming differences and collecting information into a single view. Attempts to consolidate such diverse data using data warehouses or federated queries add significant complexity and have shown limitations in flexibility. The alternative of complete semantic integration of data requires a massive, sustained effort in mapping data types and maintaining ontologies. We focused instead on creating a data architecture that leverages semantic mapping of experimental metadata, to support the rapid prototyping of scientific discovery applications with the twin goals of reducing architectural complexity while still leveraging semantic technologies to provide flexibility, efficiency and more fully characterized data relationships. A metadata ontology was developed to describe our discovery process. A metadata repository was then created by mapping metadata from existing data sources into this ontology, generating RDF triples to describe the entities. Finally an interface to the repository was designed which provided not only search and browse capabilities but complex query templates that aggregate data from both RDF and RDBMS sources. We describe how this approach (i) allows scientists to discover and link relevant data across diverse data sources and (ii) provides a platform for development of integrative informatics applications.
Similar articles
-
YeastHub: a semantic web use case for integrating data in the life sciences domain.Bioinformatics. 2005 Jun;21 Suppl 1:i85-96. doi: 10.1093/bioinformatics/bti1026. Bioinformatics. 2005. PMID: 15961502
-
Biological knowledge management: the emerging role of the Semantic Web technologies.Brief Bioinform. 2009 Jul;10(4):392-407. doi: 10.1093/bib/bbp024. Epub 2009 May 19. Brief Bioinform. 2009. PMID: 19457869 Review.
-
Linked data and provenance in biological data webs.Brief Bioinform. 2009 Mar;10(2):139-52. doi: 10.1093/bib/bbn044. Epub 2008 Dec 6. Brief Bioinform. 2009. PMID: 19060306
-
Towards a semantic medical Web: HealthCyberMap's tool for building an RDF metadata base of health information resources based on the Qualified Dublin Core Metadata Set.Med Sci Monit. 2002 Jul;8(7):MT124-36. Med Sci Monit. 2002. PMID: 12118210
-
DOORS to the semantic web and grid with a PORTAL for biomedical computing.IEEE Trans Inf Technol Biomed. 2008 Mar;12(2):191-204. doi: 10.1109/TITB.2007.905861. IEEE Trans Inf Technol Biomed. 2008. PMID: 18348949 Review.
Cited by
-
Informatics in radiology: an information model of the DICOM standard.Radiographics. 2011 Jan-Feb;31(1):295-304. doi: 10.1148/rg.311105085. Epub 2010 Oct 27. Radiographics. 2011. PMID: 20980665 Free PMC article.
-
A Semantic-Based Approach for Managing Healthcare Big Data: A Survey.J Healthc Eng. 2020 Nov 23;2020:8865808. doi: 10.1155/2020/8865808. eCollection 2020. J Healthc Eng. 2020. PMID: 33489061 Free PMC article. Review.
-
Network-based drug discovery by integrating systems biology and computational technologies.Brief Bioinform. 2013 Jul;14(4):491-505. doi: 10.1093/bib/bbs043. Epub 2012 Aug 9. Brief Bioinform. 2013. PMID: 22877768 Free PMC article.
-
Electrocorticographic mapping of expressive language function without requiring the patient to speak: A report of three cases.Epilepsy Behav Case Rep. 2016 Mar 9;6:13-8. doi: 10.1016/j.ebcr.2016.02.002. eCollection 2016. Epilepsy Behav Case Rep. 2016. PMID: 27408803 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources