Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Meta-Analysis
. 2021 May 1;273(5):900-908.
doi: 10.1097/SLA.0000000000004419.

Natural Language Processing in Surgery: A Systematic Review and Meta-analysis

Affiliations
Meta-Analysis

Natural Language Processing in Surgery: A Systematic Review and Meta-analysis

Joseph A Mellia et al. Ann Surg. .

Abstract

Objective: The aim of this study was to systematically assess the application and potential benefits of natural language processing (NLP) in surgical outcomes research.

Summary background data: Widespread implementation of electronic health records (EHRs) has generated a massive patient data source. Traditional methods of data capture, such as billing codes and/or manual review of free-text narratives in EHRs, are highly labor-intensive, costly, subjective, and potentially prone to bias.

Methods: A literature search of PubMed, MEDLINE, Web of Science, and Embase identified all articles published starting in 2000 that used NLP models to assess perioperative surgical outcomes. Evaluation metrics of NLP systems were assessed by means of pooled analysis and meta-analysis. Qualitative synthesis was carried out to assess the results and risk of bias on outcomes.

Results: The present study included 29 articles, with over half (n = 15) published after 2018. The most common outcome identified using NLP was postoperative complications (n = 14). Compared to traditional non-NLP models, NLP models identified postoperative complications with higher sensitivity [0.92 (0.87-0.95) vs 0.58 (0.33-0.79), P < 0.001]. The specificities were comparable at 0.99 (0.96-1.00) and 0.98 (0.95-0.99), respectively. Using summary of likelihood ratio matrices, traditional non-NLP models have clinical utility for confirming documentation of outcomes/diagnoses, whereas NLP models may be reliably utilized for both confirming and ruling out documentation of outcomes/diagnoses.

Conclusions: NLP usage to extract a range of surgical outcomes, particularly postoperative complications, is accelerating across disciplines and areas of clinical outcomes research. NLP and traditional non-NLP approaches demonstrate similar performance measures, but NLP is superior in ruling out documentation of surgical outcomes.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest and Source of Funding: J.P.F. has received payments as a consultant from Baxter, Becton-Dickinson, Gore, and Integra LifeSciences. This research did not receive financial support for the study. The remaining authors do not have any financial disclosures. The authors report no conflicts of interest.

Comment in

Similar articles

Cited by

References

    1. Bates M. Models of natural language understanding. Proc Natl Acad Sci 1995; 92:9977–9982.
    1. Henry J, Pylypchuk Y, Searcy T, et al. Adoption of electronic health record systems among US non-federal acute care hospitals: 2008–2015. ONC Data Brief 2016; 35:1–9.
    1. Esteva A, Robicquet A, Ramsundar B, et al. A guide to deep learning in healthcare. Nat Med 2019; 25:24–29.
    1. Matt V, Matthew H. The retrospective chart review: important methodological considerations. J Educ Eval Health Prof 2013; 10:
    1. Nadkarni PM, Ohno-Machado L, Chapman WW. Natural language processing: an introduction. J Am Med Inform Assoc 2011; 18:544–551.