Natural language processing to convert unstructured COVID-19 chest-CT reports into structured reports
- PMID: 37575311
- PMCID: PMC10413059
- DOI: 10.1016/j.ejro.2023.100512
Natural language processing to convert unstructured COVID-19 chest-CT reports into structured reports
Abstract
Background: Structured reporting has been demonstrated to increase report completeness and to reduce error rate, also enabling data mining of radiological reports. Still, structured reporting is perceived by radiologists as a fragmented reporting style, limiting their freedom of expression.
Purpose: A deep learning-based natural language processing method was developed to automatically convert unstructured COVID-19 chest CT reports into structured reports.
Methods: Two hundred-two COVID-19 chest CT were retrospectively reviewed by two experienced radiologists, who wrote for each exam a free-form text radiological report and coherently filled the template provided by the Italian Society of Medical and Interventional Radiology, used as ground-truth. A semi-supervised convolutional neural network was implemented to extract 62 categorical variables from the report. Two iterations were carried-out, the first without fine-tuning, the second one performing a fine-tuning. The performance was measured using the mean accuracy and the F1 mean score. An error analysis was performed to identify errors entirely attributable to incorrect processing of the model.
Results: The algorithm achieved a mean accuracy of 93.7% and an F1 score 93.8% in the first iteration. Most of the errors were exclusively attributable to wrong inference (46%). In the second iteration the model achieved for both parameters 95,8% and percentage of errors attributable to wrong inference decreased to 26%.
Conclusions: The convolutional neural network achieved an optimal performance in the automated conversion of free-form text into structured radiological reports, overcoming all the limitation attributed to structured reporting and finally paving the way for data mining of radiological report.
Keywords: Artificial intelligence; COVID-19; Deep learning; Natural language processing; Structured reporting.
© 2023 The Authors. Published by Elsevier Ltd.
Conflict of interest statement
The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: Giovanni Ferrando, Claudio Bedini, Sandro Ubbiali and Salvatore Valentino declare personal fees from Ebit s.r.l. Esaote group. The other authors of this manuscript declare no conflict of interest.
References
-
- Reiner, B.I., Knight, N., Siegel, E.L., 2007, Radiology Reporting, Past, Present, and Future: The Radiologist’s Perspective. Journal of the American College of Radiology 4:313–319. https://doi.org/10.1016/j.jacr.2007.01.015. - PubMed
LinkOut - more resources
Full Text Sources