Improving traumatic fracture detection on radiographs with artificial intelligence support: a multi-reader study
- PMID: 38757067
- PMCID: PMC11096271
- DOI: 10.1093/bjro/tzae011
Improving traumatic fracture detection on radiographs with artificial intelligence support: a multi-reader study
Abstract
Objectives: The aim of this study was to evaluate the diagnostic performance of nonspecialist readers with and without the use of an artificial intelligence (AI) support tool to detect traumatic fractures on radiographs of the appendicular skeleton.
Methods: The design was a retrospective, fully crossed multi-reader, multi-case study on a balanced dataset of patients (≥2 years of age) with an AI tool as a diagnostic intervention. Fifteen readers assessed 340 radiographic exams, with and without the AI tool in 2 different sessions and the time spent was automatically recorded. Reference standard was established by 3 consultant radiologists. Sensitivity, specificity, and false positives per patient were calculated.
Results: Patient-wise sensitivity increased from 72% to 80% (P < .05) and patient-wise specificity increased from 81% to 85% (P < .05) in exams aided by the AI tool compared to the unaided exams. The increase in sensitivity resulted in a relative reduction of missed fractures of 29%. The average rate of false positives per patient decreased from 0.16 to 0.14, corresponding to a relative reduction of 21%. There was no significant difference in average reading time spent per exam. The largest gain in fracture detection performance, with AI support, across all readers, was on nonobvious fractures with a significant increase in sensitivity of 11 percentage points (pp) (60%-71%).
Conclusions: The diagnostic performance for detection of traumatic fractures on radiographs of the appendicular skeleton improved among nonspecialist readers tested AI fracture detection support tool showed an overall reader improvement in sensitivity and specificity when supported by an AI tool. Improvement was seen in both sensitivity and specificity without negatively affecting the interpretation time.
Advances in knowledge: The division and analysis of obvious and nonobvious fractures are novel in AI reader comparison studies like this.
Keywords: artificial intelligence; diagnostic performance; fracture detection; multi-reader study.
© The Author(s) 2024. Published by Oxford University Press on behalf of the British Institute of Radiology.
Conflict of interest statement
R.B., M.L., P.L., and A.N. are employees of Radiobotics. M.B. is a member of the clinical advisory board of Radiobotics.
Figures
References
-
- Court-Brown CM, Caesar B.. Epidemiology of adult fractures: a review. Injury. 2006;37(8):691-697. - PubMed
-
- Shibuya N, Davis ML, Jupiter DC.. Epidemiology of foot and ankle fractures in the United States: an analysis of the National Trauma Data Bank (2007 to 2011). J Foot Ankle Surg. 2014;2953(5):606-608. - PubMed
-
- Petinaux B, Bhat R, Boniface K, Aristizabal J.. Accuracy of radiographic readings in the emergency department. Am J Emerg Med. 2011;29(1):18-25. - PubMed
LinkOut - more resources
Full Text Sources
Miscellaneous
