Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Mar 27;13(7):1251.
doi: 10.3390/diagnostics13071251.

Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models

Affiliations

Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models

Max Tigo Rietberg et al. Diagnostics (Basel). .

Abstract

Understanding the diagnostic goal of medical reports is valuable information for understanding patient flows. This work focuses on extracting the reason for taking an MRI scan of Multiple Sclerosis (MS) patients using the attached free-form reports: Diagnosis, Progression or Monitoring. We investigate the performance of domain-dependent and general state-of-the-art language models and their alignment with domain expertise. To this end, eXplainable Artificial Intelligence (XAI) techniques are used to acquire insight into the inner workings of the model, which are verified on their trustworthiness. The verified XAI explanations are then compared with explanations from a domain expert, to indirectly determine the reliability of the model. BERTje, a Dutch Bidirectional Encoder Representations from Transformers (BERT) model, outperforms RobBERT and MedRoBERTa.nl in both accuracy and reliability. The latter model (MedRoBERTa.nl) is a domain-specific model, while BERTje is a generic model, showing that domain-specific models are not always superior. Our validation of BERTje in a small prospective study shows promising results for the potential uptake of the model in a practical setting.

Keywords: BERT; health informatics; natural language processing; text classification.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Overview of our study setup. In the retrospective study (right), we train and test models on data collected retrospectively. The models are evaluated with leave-one-out cross-validation. Three standard XAI feature importance techniques are applied to the trained models, and their resulting feature importance is verified by a domain expert. The explanations are additionally validated w.r.t. to their fidelity to the model they explain. The setup and results of the prospective study are reported in Section 5.4.
Figure 2
Figure 2
Confusion matrices for the three BERT models.

References

    1. Centraal Bureau voor de Statistiek . Zorguitgaven; Kerncijfers. Centraal Bureau voor de Statistiek; Hague, The Netherland: 2022.
    1. Langlotz C.P. Structured Radiology Reporting: Are We There Yet? Radiology. 2009;253:23–25. doi: 10.1148/radiol.2531091088. - DOI - PubMed
    1. Ashfaq H.A., Lester C.A., Ballouz D., Errickson J., Woodward M.A. Medication Accuracy in Electronic Health Records for Microbial Keratitis. JAMA Ophthalmol. 2019;137:929–931. doi: 10.1001/jamaophthalmol.2019.1444. - DOI - PMC - PubMed
    1. Hernandez-Boussard T., Tamang S., Blayney D., Brooks J., Shah N. New Paradigms for Patient-Centered Outcomes Research in Electronic Medical Records. eGEMs. 2016;4:1231. doi: 10.13063/2327-9214.1231. - DOI - PMC - PubMed
    1. Payne T.H., Zhao L.P., Le C., Wilcox P., Yi T., Hinshaw J., Hussey D., Kostrinsky-Thomas A., Hale M., Brimm J., et al. Electronic health records contain dispersed risk factor information that could be used to prevent breast and ovarian cancer. J. Am. Med Inform. Assoc. JAMIA. 2020;27:1443–1449. doi: 10.1093/jamia/ocaa152. - DOI - PMC - PubMed