Deep Learning Approach for Negation and Speculation Detection for Automated Important Finding Flagging and Extraction in Radiology Report: Internal Validation and Technique Comparison Study

Kung-Hsun Weng¹, Chung-Feng Liu², Chia-Jung Chen³

Affiliations

¹ Department of Medical Imaging, Chi Mei Medical Center, Chiali, Tainan, Taiwan.
² Department of Medical Research, Chi Mei Medical Center, Tainan, Taiwan.
³ Department of Information Systems, Chi Mei Medical Center, Tainan, Taiwan.

PMID: 37097731
PMCID: PMC10170361
DOI: 10.2196/46348

Deep Learning Approach for Negation and Speculation Detection for Automated Important Finding Flagging and Extraction in Radiology Report: Internal Validation and Technique Comparison Study

Kung-Hsun Weng et al. JMIR Med Inform. 2023.

. 2023 Apr 25:11:e46348.

doi: 10.2196/46348.

Authors

Kung-Hsun Weng¹, Chung-Feng Liu², Chia-Jung Chen³

Affiliations

¹ Department of Medical Imaging, Chi Mei Medical Center, Chiali, Tainan, Taiwan.
² Department of Medical Research, Chi Mei Medical Center, Tainan, Taiwan.
³ Department of Information Systems, Chi Mei Medical Center, Tainan, Taiwan.

PMID: 37097731
PMCID: PMC10170361
DOI: 10.2196/46348

Abstract

Background: Negation and speculation unrelated to abnormal findings can lead to false-positive alarms for automatic radiology report highlighting or flagging by laboratory information systems.

Objective: This internal validation study evaluated the performance of natural language processing methods (NegEx, NegBio, NegBERT, and transformers).

Methods: We annotated all negative and speculative statements unrelated to abnormal findings in reports. In experiment 1, we fine-tuned several transformer models (ALBERT [A Lite Bidirectional Encoder Representations from Transformers], BERT [Bidirectional Encoder Representations from Transformers], DeBERTa [Decoding-Enhanced BERT With Disentangled Attention], DistilBERT [Distilled version of BERT], ELECTRA [Efficiently Learning an Encoder That Classifies Token Replacements Accurately], ERNIE [Enhanced Representation through Knowledge Integration], RoBERTa [Robustly Optimized BERT Pretraining Approach], SpanBERT, and XLNet) and compared their performance using precision, recall, accuracy, and F₁-scores. In experiment 2, we compared the best model from experiment 1 with 3 established negation and speculation-detection algorithms (NegEx, NegBio, and NegBERT).

Results: Our study collected 6000 radiology reports from 3 branches of the Chi Mei Hospital, covering multiple imaging modalities and body parts. A total of 15.01% (105,755/704,512) of words and 39.45% (4529/11,480) of important diagnostic keywords occurred in negative or speculative statements unrelated to abnormal findings. In experiment 1, all models achieved an accuracy of >0.98 and F₁-score of >0.90 on the test data set. ALBERT exhibited the best performance (accuracy=0.991; F₁-score=0.958). In experiment 2, ALBERT outperformed the optimized NegEx, NegBio, and NegBERT methods in terms of overall performance (accuracy=0.996; F₁-score=0.991), in the prediction of whether diagnostic keywords occur in speculative statements unrelated to abnormal findings, and in the improvement of the performance of keyword extraction (accuracy=0.996; F₁-score=0.997).

Conclusions: The ALBERT deep learning method showed the best performance. Our results represent a significant advancement in the clinical applications of computer-aided notification systems.

Keywords: BERT; Bidirectional Encoder Representations from Transformers; clinical application; deep learning; natural language processing; negation; radiology; radiology report; supervised learning; transfer learning; validation study.

©Kung-Hsun Weng, Chung-Feng Liu, Chia-Jung Chen. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 25.04.2023.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: None declared.

Figures

**Figure 1**
Research flow. n: number of reports.

**Figure 2**
Experiment 1. X: the original text, ŷ: class predicted by the model; y: the gold standard. Category 0: positive statements or speculative statements potentially related to abnormal findings; category 1: negative statements or speculative statements unrelated to abnormal findings. ALBERT: A Lite Bidirectional Encoder Representations From Transformers; BERT: Bidirectional Encoder Representations From Transformers; DeBERTa: Decoding-Enhanced Bidirectional Encoder Representations From Transformers With Disentangled Attention; DistilBERT: Distilled version of Bidirectional Encoder Representations From Transformers; ELECTRA: Efficiently Learning an Encoder That Classifies Token Replacements Accurately; ERNIE: Enhanced Representation through Knowledge Integration; RoBERTa: Robustly Optimized Bidirectional Encoder Representations From Transformers Pretraining Approach; RUL: right upper lobe.

**Figure 3**
Experiment 2 Note. X: the original text; ŷ: class predicted by the model; y: the gold standard; category 0: positive statements or speculative statements potentially related to abnormal findings; category 1: negative statements or speculative statements unrelated to abnormal findings; bold text: word matching a designated “important keyword.” Exp: experiment.

See this image and copyright information in PMC

References

1. Lacson R, Prevedello LM, Andriole KP, O'Connor SD, Roy C, Gandhi T, Dalal AK, Sato L, Khorasani R. Four-year impact of an alert notification system on closed-loop communication of critical test results. AJR Am J Roentgenol. 2014 Dec;203(5):933–8. doi: 10.2214/AJR.14.13064. https://europepmc.org/abstract/MED/25341129 - DOI - PMC - PubMed
1. Ignácio FC, de Souza LR, D'Ippolito G, Garcia MM. Radiology report: what is the opinion of the referring physician? Radiol Bras. 2018 Sep;51(5):308–12. doi: 10.1590/0100-3984.2017.0115. https://europepmc.org/abstract/MED/30369658 - DOI - PMC - PubMed
1. Reda AS, Hashem DA, Khashoggi K, Abukhodair F. Clinicians' behavior toward radiology reports: a cross-sectional study. Cureus. 2020 Nov 05;12(11):e11336. doi: 10.7759/cureus.11336. https://europepmc.org/abstract/MED/33304672 - DOI - PMC - PubMed
1. European Society of Radiology (ESR) ESR guidelines for the communication of urgent and unexpected findings. Insights Imaging. 2012 Feb;3(1):1–3. doi: 10.1007/s13244-011-0135-y. https://europepmc.org/abstract/MED/22695992 - DOI - PMC - PubMed
1. Nakamura Y, Hanaoka S, Nomura Y, Nakao T, Miki S, Watadani T, Yoshikawa T, Hayashi N, Abe O. Automatic detection of actionable radiology reports using bidirectional encoder representations from transformers. BMC Med Inform Decis Mak. 2021 Sep 11;21(1):262. doi: 10.1186/s12911-021-01623-6. https://bmcmedinformdecismak.biomedcentral.com/articles/10.1186/s12911-0... 10.1186/s12911-021-01623-6 - DOI - DOI - PMC - PubMed

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Deep Learning Approach for Negation and Speculation Detection for Automated Important Finding Flagging and Extraction in Radiology Report: Internal Validation and Technique Comparison Study

Affiliations

Deep Learning Approach for Negation and Speculation Detection for Automated Important Finding Flagging and Extraction in Radiology Report: Internal Validation and Technique Comparison Study

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

LinkOut - more resources

Full Text Sources