Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Mar 1;25(3):331-336.
doi: 10.1093/jamia/ocx132.

CLAMP - a toolkit for efficiently building customized clinical natural language processing pipelines

Affiliations

CLAMP - a toolkit for efficiently building customized clinical natural language processing pipelines

Ergin Soysal et al. J Am Med Inform Assoc. .

Abstract

Existing general clinical natural language processing (NLP) systems such as MetaMap and Clinical Text Analysis and Knowledge Extraction System have been successfully applied to information extraction from clinical text. However, end users often have to customize existing systems for their individual tasks, which can require substantial NLP skills. Here we present CLAMP (Clinical Language Annotation, Modeling, and Processing), a newly developed clinical NLP toolkit that provides not only state-of-the-art NLP components, but also a user-friendly graphic user interface that can help users quickly build customized NLP pipelines for their individual applications. Our evaluation shows that the CLAMP default pipeline achieved good performance on named entity recognition and concept encoding. We also demonstrate the efficiency of the CLAMP graphic user interface in building customized, high-performance NLP pipelines with 2 use cases, extracting smoking status and lab test values. CLAMP is publicly available for research use, and we believe it is a unique asset for the clinical NLP community.

Keywords: clinical text processing; machine learning; natural language processing.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
The user interface for building a pipeline in CLAMP.
Figure 2.
Figure 2.
The interface in CLAMP for annotating entities and relations.
Figure 3.
Figure 3.
The interface for selecting features and evaluation options for building machine learning–based NER models using CLAMP.

References

    1. Demner-Fushman D,Chapman WW,McDonald CJ. What can natural language processing do for clinical decision support? J Biomed Inform. 2009;425:760–72. - PMC - PubMed
    1. Savova GK,Masanz JJ,Ogren PV,et al.Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.J Am Med Inform Assoc. 2010;175:507–13. - PMC - PubMed
    1. Aronson AR,Lang F-M. An overview of MetaMap: historical perspective and recent advances.J Am Med Inform Assoc. 2010;173:229–36. - PMC - PubMed
    1. Demner-Fushman D,Rogers WJ,Aronson AR. MetaMap Lite: an evaluation of a new Java implementation of MetaMap.J Am Med Inform Assoc. 2017;244:841–44. - PMC - PubMed
    1. Friedman C. Towards a comprehensive medical language processing system: methods and issues.Proc AMIA Annu Fall Symp. 1997:595–99. - PMC - PubMed