Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models

Max Tigo Rietberg¹, Van Bach Nguyen², Jeroen Geerdink³, Onno Vijlbrief³, Christin Seifert²

Affiliations

¹ Faculty of EEMCS, University of Twente, 7500 AE Enschede, The Netherlands.
² Institute for Artificial Intelligence in Medicine, University of Duisburg-Essen, 45131 Essen, Germany.
³ Hospital Group Twente (ZGT), 7555 DL Hengelo, The Netherlands.

PMID: 37046469
PMCID: PMC10093295
DOI: 10.3390/diagnostics13071251

Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models

Max Tigo Rietberg et al. Diagnostics (Basel). 2023.

. 2023 Mar 27;13(7):1251.

doi: 10.3390/diagnostics13071251.

Authors

Max Tigo Rietberg¹, Van Bach Nguyen², Jeroen Geerdink³, Onno Vijlbrief³, Christin Seifert²

Affiliations

¹ Faculty of EEMCS, University of Twente, 7500 AE Enschede, The Netherlands.
² Institute for Artificial Intelligence in Medicine, University of Duisburg-Essen, 45131 Essen, Germany.
³ Hospital Group Twente (ZGT), 7555 DL Hengelo, The Netherlands.

PMID: 37046469
PMCID: PMC10093295
DOI: 10.3390/diagnostics13071251

Abstract

Understanding the diagnostic goal of medical reports is valuable information for understanding patient flows. This work focuses on extracting the reason for taking an MRI scan of Multiple Sclerosis (MS) patients using the attached free-form reports: Diagnosis, Progression or Monitoring. We investigate the performance of domain-dependent and general state-of-the-art language models and their alignment with domain expertise. To this end, eXplainable Artificial Intelligence (XAI) techniques are used to acquire insight into the inner workings of the model, which are verified on their trustworthiness. The verified XAI explanations are then compared with explanations from a domain expert, to indirectly determine the reliability of the model. BERTje, a Dutch Bidirectional Encoder Representations from Transformers (BERT) model, outperforms RobBERT and MedRoBERTa.nl in both accuracy and reliability. The latter model (MedRoBERTa.nl) is a domain-specific model, while BERTje is a generic model, showing that domain-specific models are not always superior. Our validation of BERTje in a small prospective study shows promising results for the potential uptake of the model in a practical setting.

Keywords: BERT; health informatics; natural language processing; text classification.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

**Figure 1**
Overview of our study setup. In the retrospective study (right), we train and test models on data collected retrospectively. The models are evaluated with leave-one-out cross-validation. Three standard XAI feature importance techniques are applied to the trained models, and their resulting feature importance is verified by a domain expert. The explanations are additionally validated w.r.t. to their fidelity to the model they explain. The setup and results of the prospective study are reported in Section 5.4.

**Figure 2**
Confusion matrices for the three BERT models.

See this image and copyright information in PMC

References

1. Centraal Bureau voor de Statistiek . Zorguitgaven; Kerncijfers. Centraal Bureau voor de Statistiek; Hague, The Netherland: 2022.
1. Langlotz C.P. Structured Radiology Reporting: Are We There Yet? Radiology. 2009;253:23–25. doi: 10.1148/radiol.2531091088. - DOI - PubMed
1. Ashfaq H.A., Lester C.A., Ballouz D., Errickson J., Woodward M.A. Medication Accuracy in Electronic Health Records for Microbial Keratitis. JAMA Ophthalmol. 2019;137:929–931. doi: 10.1001/jamaophthalmol.2019.1444. - DOI - PMC - PubMed
1. Hernandez-Boussard T., Tamang S., Blayney D., Brooks J., Shah N. New Paradigms for Patient-Centered Outcomes Research in Electronic Medical Records. eGEMs. 2016;4:1231. doi: 10.13063/2327-9214.1231. - DOI - PMC - PubMed
1. Payne T.H., Zhao L.P., Le C., Wilcox P., Yi T., Hinshaw J., Hussey D., Kostrinsky-Thomas A., Hale M., Brimm J., et al. Electronic health records contain dispersed risk factor information that could be used to prevent breast and ovarian cancer. J. Am. Med Inform. Assoc. JAMIA. 2020;27:1443–1449. doi: 10.1093/jamia/ocaa152. - DOI - PMC - PubMed

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models

Affiliations

Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

LinkOut - more resources

Full Text Sources