Biological data integration: wrapping data and tools
- PMID: 12075666
- DOI: 10.1109/titb.2002.1006299
Biological data integration: wrapping data and tools
Abstract
Nowadays scientific data is inevitably digital and stored in a wide variety of formats in heterogeneous systems. Scientists need to access an integrated view of remote or local heterogeneous data sources with advanced data accessing, analyzing, and visualization tools. Building a digital library for scientific data requires accessing and manipulating data extracted from flat files or databases, documents retrieved from the Web as well as data generated by software. We present an approach to wrapping web data sources, databases, flat files, or data generated by tools through a database view mechanism. Generally, a wrapper has two tasks: it first sends a query to the source to retrieve data and, second builds the expected output with respect to the virtual structure. Our wrappers are composed of a retrieval component based on an intermediate object view mechanism called search views mapping the source capabilities to attributes, and an eXtensible Markup Language (XML) engine, respectively, to perform these two tasks. The originality of the approach consists of: 1) a generic view mechanism to access seamlessly data sources with limited capabilities and 2) the ability to wrap data sources as well as the useful specific tools they may provide. Our approach has been developed and demonstrated as part of the multidatabase system supporting queries via uniform object protocol model (OPM) interfaces.
Similar articles
-
Techniques for optimization of queries on integrated biological resources.J Bioinform Comput Biol. 2004 Jun;2(2):375-411. doi: 10.1142/s0219720004000648. J Bioinform Comput Biol. 2004. PMID: 15297988 Review.
-
Architecture of a mediator for a bioinformatics database federation.IEEE Trans Inf Technol Biomed. 2002 Jun;6(2):116-22. doi: 10.1109/titb.2002.1006298. IEEE Trans Inf Technol Biomed. 2002. PMID: 12075665
-
Building a bioinformatics ontology using OIL.IEEE Trans Inf Technol Biomed. 2002 Jun;6(2):135-41. doi: 10.1109/titb.2002.1006301. IEEE Trans Inf Technol Biomed. 2002. PMID: 12075668
-
Advanced query mechanisms for biological databases.Proc Int Conf Intell Syst Mol Biol. 1998;6:43-51. Proc Int Conf Intell Syst Mol Biol. 1998. PMID: 9783208
-
Automation of in-silico data analysis processes through workflow management systems.Brief Bioinform. 2008 Jan;9(1):57-68. doi: 10.1093/bib/bbm056. Epub 2007 Dec 2. Brief Bioinform. 2008. PMID: 18056132 Review.
Cited by
-
Trends in meta-analysis of genetic association studies.J Hum Genet. 2008;53(1):1-9. doi: 10.1007/s10038-007-0223-5. Epub 2007 Dec 12. J Hum Genet. 2008. PMID: 18071627
-
Dynamic integration of biological data sources using the data concierge.Health Inf Sci Syst. 2013 Feb 4;1:7. doi: 10.1186/2047-2501-1-7. eCollection 2013. Health Inf Sci Syst. 2013. PMID: 25825659 Free PMC article.
-
A novel framework for horizontal and vertical data integration in cancer studies with application to survival time prediction models.Biol Direct. 2019 Nov 21;14(1):22. doi: 10.1186/s13062-019-0249-6. Biol Direct. 2019. PMID: 31752974 Free PMC article.
-
An XML standard for the dissemination of annotated 2D gel electrophoresis data complemented with mass spectrometry results.BMC Bioinformatics. 2004 Jan 29;5:9. doi: 10.1186/1471-2105-5-9. BMC Bioinformatics. 2004. PMID: 15005801 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources