Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2020 Sep:109:103526.
doi: 10.1016/j.jbi.2020.103526. Epub 2020 Aug 6.

Clinical concept extraction: A methodology review

Affiliations
Review

Clinical concept extraction: A methodology review

Sunyang Fu et al. J Biomed Inform. 2020 Sep.

Abstract

Background: Concept extraction, a subdomain of natural language processing (NLP) with a focus on extracting concepts of interest, has been adopted to computationally extract clinical information from text for a wide range of applications ranging from clinical decision support to care quality improvement.

Objectives: In this literature review, we provide a methodology review of clinical concept extraction, aiming to catalog development processes, available methods and tools, and specific considerations when developing clinical concept extraction applications.

Methods: Based on the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines, a literature search was conducted for retrieving EHR-based information extraction articles written in English and published from January 2009 through June 2019 from Ovid MEDLINE In-Process & Other Non-Indexed Citations, Ovid MEDLINE, Ovid EMBASE, Scopus, Web of Science, and the ACM Digital Library.

Results: A total of 6,686 publications were retrieved. After title and abstract screening, 228 publications were selected. The methods used for developing clinical concept extraction applications were discussed in this review.

Keywords: Concept extraction; Deep learning; Electronic health records; Information extraction; Machine learning; Natural language processing.

PubMed Disclaimer

Conflict of interest statement

Competing Interests

The authors have no competing interests to declare.

Figures

Figure 1.
Figure 1.
An example of using concept extraction for stroke research.
Figure 2.
Figure 2.
Overview of article selection process.
Figure 3.
Figure 3.
Trend view of clinical concept extraction research over different approaches.
Figure 4.
Figure 4.
Overview of the method utilization.
Figure 5.
Figure 5.
The development process of clinical concept extraction applications.
Figure 6.
Figure 6.
Data collection and corpus annotation process.

References

    1. Jones SS, Rudin RS, Perry T, Shekelle PG. Health information technology: an updated systematic review with a focus on meaningful use. Ann Intern Med. 2014;160(1):48–54. - PubMed
    1. Friedman CP, Wong AK, Blumenthal D. Achieving a nationwide learning health system. Science translational medicine. 2010;2(57):57cm29–57cm29. - PubMed
    1. Demner-Fushman D, Chapman WW, McDonald CJ. What can natural language processing do for clinical decision support?. [Review] [132 refs]. Journal of Biomedical Informatics. 2009;42(5):760–72. - PMC - PubMed
    1. Cowie J, Wilks Y. Information extraction. Handbook of Natural Language Processing. 2000;56:57.
    1. Wang Y, Wang L, Rastegar-Mojarad M, Moon S, Shen F, Afzal N, et al. Clinical information extraction applications: A literature review. Journal of Biomedical Informatics. 2018;77:34–49. - PMC - PubMed

Publication types

LinkOut - more resources