Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 Nov 24;7 Suppl 3(Suppl 3):S6.
doi: 10.1186/1471-2105-7-S3-S6.

Mapping data elements to terminological resources for integrating biomedical data sources

Affiliations

Mapping data elements to terminological resources for integrating biomedical data sources

Fleur Mougin et al. BMC Bioinformatics. .

Abstract

Background: Data integration is a crucial task in the biomedical domain and integrating data sources is one approach to integrating data. Data elements (DEs) in particular play an important role in data integration. We combine schema- and instance-based approaches to mapping DEs to terminological resources in order to facilitate data sources integration.

Methods: We extracted DEs from eleven disparate biomedical sources. We compared these DEs to concepts and/or terms in biomedical controlled vocabularies and to reference DEs. We also exploited DE values to disambiguate underspecified DEs and to identify additional mappings.

Results: 82.5% of the 474 DEs studied are mapped to entries of a terminological resource and 74.7% of the whole set can be associated with reference DEs. Only 6.6% of the DEs had values that could be semantically typed.

Conclusion: Our study suggests that the integration of biomedical sources can be achieved automatically with limited precision and largely facilitated by mapping DEs to terminological resources.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Example of the three Genew Web pages for the TNXB, HFE, and BRCA1 genes. Examples of data elements are encircled (Approved Symbol, Approved Name)
Figure 2
Figure 2
Examples of the exploitation of the values of two data elements: (a) using the UMLS as a terminological resource, (b) using heuristics.

Similar articles

Cited by

References

    1. Hernandez T, Kambhampati S. Integration of Biological Sources: Current Systems and Challenges Ahead. Proc ACM SIGMOD Conf. 2004;33:51–60.
    1. Stevens R, Baker P, Bechhofer S, Ng G, Jacoby A, Paton NW, Goble CA, Brass A. TAMBIS: Transparent Access to Multiple Bioinformatics Information Sources. Bioinformatics. 2000;16:184–185. - PubMed
    1. HPRD http://www.hprd.org/
    1. Entrez Gene http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=gene
    1. PDB http://www.rcsb.org/pdb/

Publication types

LinkOut - more resources