Comparison of diagnostic performance of a deep learning algorithm, emergency physicians, junior radiologists and senior radiologists in the detection of appendicular fractures in children
- PMID: 36877239
- DOI: 10.1007/s00247-023-05621-w
Comparison of diagnostic performance of a deep learning algorithm, emergency physicians, junior radiologists and senior radiologists in the detection of appendicular fractures in children
Abstract
Background: Advances have been made in the use of artificial intelligence (AI) in the field of diagnostic imaging, particularly in the detection of fractures on conventional radiographs. Studies looking at the detection of fractures in the pediatric population are few. The anatomical variations and evolution according to the child's age require specific studies of this population. Failure to diagnose fractures early in children may lead to serious consequences for growth.
Objective: To evaluate the performance of an AI algorithm based on deep neural networks toward detecting traumatic appendicular fractures in a pediatric population. To compare sensitivity, specificity, positive predictive value and negative predictive value of different readers and the AI algorithm.
Materials and methods: This retrospective study conducted on 878 patients younger than 18 years of age evaluated conventional radiographs obtained after recent non-life-threatening trauma. All radiographs of the shoulder, arm, elbow, forearm, wrist, hand, leg, knee, ankle and foot were evaluated. The diagnostic performance of a consensus of radiology experts in pediatric imaging (reference standard) was compared with those of pediatric radiologists, emergency physicians, senior residents and junior residents. The predictions made by the AI algorithm and the annotations made by the different physicians were compared.
Results: The algorithm predicted 174 fractures out of 182, corresponding to a sensitivity of 95.6%, a specificity of 91.64% and a negative predictive value of 98.76%. The AI predictions were close to that of pediatric radiologists (sensitivity 98.35%) and that of senior residents (95.05%) and were above those of emergency physicians (81.87%) and junior residents (90.1%). The algorithm identified 3 (1.6%) fractures not initially seen by pediatric radiologists.
Conclusion: This study suggests that deep learning algorithms can be useful in improving the detection of fractures in children.
Keywords: Artificial intelligence; Children; Convolutional neural networks; Deep learning; Emergency medicine, Fracture; Radiography.
© 2023. The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.
Similar articles
-
Assessment of an artificial intelligence aid for the detection of appendicular skeletal fractures in children and young adults by senior and junior radiologists.Pediatr Radiol. 2022 Oct;52(11):2215-2226. doi: 10.1007/s00247-022-05496-3. Epub 2022 Sep 28. Pediatr Radiol. 2022. PMID: 36169667
-
Diagnostic performance of an AI algorithm for the detection of appendicular bone fractures in pediatric patients.Eur J Radiol. 2024 Sep;178:111637. doi: 10.1016/j.ejrad.2024.111637. Epub 2024 Jul 17. Eur J Radiol. 2024. PMID: 39053306
-
Detecting pediatric wrist fractures using deep-learning-based object detection.Pediatr Radiol. 2023 May;53(6):1125-1134. doi: 10.1007/s00247-023-05588-8. Epub 2023 Jan 18. Pediatr Radiol. 2023. PMID: 36650360
-
Artificial Intelligence (AI) for Fracture Diagnosis: An Overview of Current Products and Considerations for Clinical Adoption, From the AJR Special Series on AI Applications.AJR Am J Roentgenol. 2022 Dec;219(6):869-878. doi: 10.2214/AJR.22.27873. Epub 2022 Jun 22. AJR Am J Roentgenol. 2022. PMID: 35731103 Review.
-
What Are the Applications and Limitations of Artificial Intelligence for Fracture Detection and Classification in Orthopaedic Trauma Imaging? A Systematic Review.Clin Orthop Relat Res. 2019 Nov;477(11):2482-2491. doi: 10.1097/CORR.0000000000000848. Clin Orthop Relat Res. 2019. PMID: 31283727 Free PMC article.
Cited by
-
Artificial intelligence in fracture detection on radiographs: a literature review.Jpn J Radiol. 2025 Apr;43(4):551-585. doi: 10.1007/s11604-024-01702-4. Epub 2024 Nov 14. Jpn J Radiol. 2025. PMID: 39538068 Review.
-
Pediatric surgical trainees and artificial intelligence: a comparative analysis of DeepSeek, Copilot, Google Bard and pediatric surgeons' performance on the European Pediatric Surgical In-Training Examinations (EPSITE).Pediatr Surg Int. 2025 Aug 8;41(1):247. doi: 10.1007/s00383-025-06104-9. Pediatr Surg Int. 2025. PMID: 40781215 Free PMC article.
-
Impact of test set composition on AI performance in pediatric wrist fracture detection in X-rays.Eur Radiol. 2025 May 16. doi: 10.1007/s00330-025-11669-z. Online ahead of print. Eur Radiol. 2025. PMID: 40379941
-
Artificial intelligence (AI) in radiological paediatric fracture assessment: an updated systematic review.Eur Radiol. 2025 Sep;35(9):5264-5286. doi: 10.1007/s00330-025-11449-9. Epub 2025 Mar 10. Eur Radiol. 2025. PMID: 40063108
-
Comparisons among radiologist, MR findings and radiomics-clinical models in predicting placenta accreta spectrum disorders: a multicenter study.Arch Gynecol Obstet. 2025 Jun;311(6):1751-1764. doi: 10.1007/s00404-025-07960-5. Epub 2025 Jan 30. Arch Gynecol Obstet. 2025. PMID: 39883136 Free PMC article.
References
-
- He K, Zhang X, Ren S, Sun J (2015) Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification. In: 2015 IEEE International Conference on Computer Vision (ICCV) pp 1026–1034
MeSH terms
LinkOut - more resources
Full Text Sources
Medical