Deep learning predicts hip fracture using confounding patient and healthcare variables
- PMID: 31304378
- PMCID: PMC6550136
- DOI: 10.1038/s41746-019-0105-1
Deep learning predicts hip fracture using confounding patient and healthcare variables
Abstract
Hip fractures are a leading cause of death and disability among older adults. Hip fractures are also the most commonly missed diagnosis on pelvic radiographs, and delayed diagnosis leads to higher cost and worse outcomes. Computer-aided diagnosis (CAD) algorithms have shown promise for helping radiologists detect fractures, but the image features underpinning their predictions are notoriously difficult to understand. In this study, we trained deep-learning models on 17,587 radiographs to classify fracture, 5 patient traits, and 14 hospital process variables. All 20 variables could be individually predicted from a radiograph, with the best performances on scanner model (AUC = 1.00), scanner brand (AUC = 0.98), and whether the order was marked "priority" (AUC = 0.79). Fracture was predicted moderately well from the image (AUC = 0.78) and better when combining image features with patient data (AUC = 0.86, DeLong paired AUC comparison, p = 2e-9) or patient data plus hospital process features (AUC = 0.91, p = 1e-21). Fracture prediction on a test set that balanced fracture risk across patient variables was significantly lower than a random test set (AUC = 0.67, DeLong unpaired AUC comparison, p = 0.003); and on a test set with fracture risk balanced across patient and hospital process variables, the model performed randomly (AUC = 0.52, 95% CI 0.46-0.58), indicating that these variables were the main source of the model's fracture predictions. A single model that directly combines image features, patient, and hospital process data outperforms a Naive Bayes ensemble of an image-only model prediction, patient, and hospital process data. If CAD algorithms are inexplicably leveraging patient and process variables in their predictions, it is unclear how radiologists should interpret their predictions in the context of other known patient data. Further research is needed to illuminate deep-learning decision processes so that computers and clinicians can effectively cooperate.
Keywords: Computer science; Radiography; Statistics.
Conflict of interest statement
Competing interestsM.A.B. has received consulting fees from Whiteboard Coordinator, Inc. J.T.D. has received consulting fees or honoraria from Janssen Pharmaceuticals, GlaxoSmithKline, AstraZeneca, and Hoffman-La Roche. J.T.D. is a scientific advisor to L.A.M. Therapeutics and holds equity in NuMedii, Ayasdi, and Ontomics. M.L., M.V.M., and T.M.S. are employees of Verily Life Sciences.
Figures




Similar articles
-
Is Deep Learning On Par with Human Observers for Detection of Radiographically Visible and Occult Fractures of the Scaphoid?Clin Orthop Relat Res. 2020 Nov;478(11):2653-2659. doi: 10.1097/CORR.0000000000001318. Clin Orthop Relat Res. 2020. PMID: 32452927 Free PMC article.
-
Comparison of Chest Radiograph Interpretations by Artificial Intelligence Algorithm vs Radiology Residents.JAMA Netw Open. 2020 Oct 1;3(10):e2022779. doi: 10.1001/jamanetworkopen.2020.22779. JAMA Netw Open. 2020. PMID: 33034642 Free PMC article.
-
Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study.PLoS Med. 2018 Nov 6;15(11):e1002683. doi: 10.1371/journal.pmed.1002683. eCollection 2018 Nov. PLoS Med. 2018. PMID: 30399157 Free PMC article.
-
Artificial Intelligence for Hip Fracture Detection and Outcome Prediction: A Systematic Review and Meta-analysis.JAMA Netw Open. 2023 Mar 1;6(3):e233391. doi: 10.1001/jamanetworkopen.2023.3391. JAMA Netw Open. 2023. PMID: 36930153 Free PMC article.
-
Artificial intelligence and machine learning on diagnosis and classification of hip fracture: systematic review.J Orthop Surg Res. 2022 Dec 1;17(1):520. doi: 10.1186/s13018-022-03408-7. J Orthop Surg Res. 2022. PMID: 36456982 Free PMC article.
Cited by
-
Critical evaluation of deep neural networks for wrist fracture detection.Sci Rep. 2021 Mar 16;11(1):6006. doi: 10.1038/s41598-021-85570-2. Sci Rep. 2021. PMID: 33727668 Free PMC article.
-
Differential diagnosis of benign and malignant vertebral fracture on CT using deep learning.Eur Radiol. 2021 Dec;31(12):9612-9619. doi: 10.1007/s00330-021-08014-5. Epub 2021 May 16. Eur Radiol. 2021. PMID: 33993335 Free PMC article.
-
Artificial intelligence diagnostic model for multi-site fracture X-ray images of extremities based on deep convolutional neural networks.Quant Imaging Med Surg. 2024 Feb 1;14(2):1930-1943. doi: 10.21037/qims-23-878. Epub 2024 Jan 9. Quant Imaging Med Surg. 2024. PMID: 38415122 Free PMC article.
-
There is no such thing as a validated prediction model.BMC Med. 2023 Feb 24;21(1):70. doi: 10.1186/s12916-023-02779-w. BMC Med. 2023. PMID: 36829188 Free PMC article.
-
Automatic detection and classification of peri-prosthetic femur fracture.Int J Comput Assist Radiol Surg. 2022 Apr;17(4):649-660. doi: 10.1007/s11548-021-02552-5. Epub 2022 Feb 14. Int J Comput Assist Radiol Surg. 2022. PMID: 35157227 Free PMC article.
References
-
- Ward, R. J. et al. ACR Appropriateness Criteria® Acute HipPain—Suspected Fracture. https://acsearch.acr.org/docs/3082587/Narrative/ (2018).
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous