Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 May-Jun;13(3):334-43.
doi: 10.1197/jamia.M1823. Epub 2006 Feb 24.

A system for automated lexical mapping

Affiliations

A system for automated lexical mapping

Jennifer Y Sun et al. J Am Med Inform Assoc. 2006 May-Jun.

Abstract

Objective: To automate the mapping of disparate databases to standardized medical vocabularies.

Background: Merging of clinical systems and medical databases, or aggregation of information from disparate databases, frequently requires a process whereby vocabularies are compared and similar concepts are mapped.

Design: Using a normalization phase followed by a novel alignment stage inspired by DNA sequence alignment methods, automated lexical mapping can map terms from various databases to standard vocabularies such as the UMLS (Unified Medical Language System) and LOINC (Logical Observation Identifier Names and Codes).

Measurements: This automated lexical mapping was evaluated using three real-world laboratory databases from different health care institutions. The authors report the sensitivity, specificity, percentage correct (true positives plus true negatives divided by total number of terms), and true positive and true negative rates as measures of system performance.

Results: The alignment algorithm was able to map 57% to 78% (average of 63% over all runs and databases) of equivalent concepts through lexical mapping alone. True positive rates ranged from 18% to 70%; true negative rates ranged from 5% to 52%.

Conclusion: Lexical mapping can facilitate the integration of data from diverse sources and decrease the time and cost required for manual mapping and integration of clinical systems and medical databases.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
An overall flowchart detailing the steps of the alignment process.
Figure 2.
Figure 2.
Matrix representation of alignment. Query term is alk ptase (on the vertical) and dictionary term is alkaline phosphatase (on the horizontal). Highlighted cells represent positions where a character of the query term matches a character from the dictionary term within the optimal alignment.
Figure 3.
Figure 3.
A graphical representation of how the chain is linked together. The dark solid arrows of the matrix show the reading of the cells from lower left and up the column. Each column is read from bottom to top, proceeding from the leftmost column to the right. The lower image shows the chain representation of the matrix. Coordinates represent the position of the cell in the matrix. The cells marked “1” are read from the matrix starting from the lower left corner, up the first column (column 0 in ▶), then proceeding up the second column (column 1) and continuing through the table from left to right.

Similar articles

Cited by

References

    1. Rocha RA, Rocha BHSC, Huff SM. Automated translation between medical vocabularies using a frame-based interlingua. Proc Annu Symp Comput Appl Med Care. 1993:690–4. - PMC - PubMed
    1. Rocha RA, Huff SM. Using diagrams to map controlled medical vocabularies. Proc Annu Symp Comput Appl Med Care. 1994:172–6. - PMC - PubMed
    1. Dolin RH, Huff SM, Rocha RA, Spackman KA, Campbell KE. Evaluation of a “lexically assign, logically refine” strategy for semi-automated integration of overlapping terminologies. J Am Med Inform Assoc. 1998;5:203–13. - PMC - PubMed
    1. Elkin PL, Cimino JJ, Lowe HJ, Aronow DB, Payne TH, Pincetl PS, Barnett GO. Mapping to MeSH: the art of trapping MeSH equivalence from within narrative text. Proc 12th SCAMC. 1988:185–90.
    1. Barrows RC, Cimino JJ, Claton PD. Mapping clinically useful terminology to a controlled medical vocabulary. Proc Annu Symp Comput Appl Med Care. 1994:211–5. - PMC - PubMed

MeSH terms