Methodology and development of a machine learning probability calculator: Data heterogeneity limits ability to predict recurrence after arthroscopic Bankart repair
- PMID: 39324357
- PMCID: PMC11948171
- DOI: 10.1002/ksa.12443
Methodology and development of a machine learning probability calculator: Data heterogeneity limits ability to predict recurrence after arthroscopic Bankart repair
Abstract
Purpose: The aim of this study was to develop and train a machine learning (ML) algorithm to create a clinical decision support tool (i.e., ML-driven probability calculator) to be used in clinical practice to estimate recurrence rates following an arthroscopic Bankart repair (ABR).
Methods: Data from 14 previously published studies were collected. Inclusion criteria were (1) patients treated with ABR without remplissage for traumatic anterior shoulder instability and (2) a minimum of 2 years follow-up. Risk factors associated with recurrence were identified using bivariate logistic regression analysis. Subsequently, four ML algorithms were developed and internally validated. The predictive performance was assessed using discrimination, calibration and the Brier score.
Results: In total, 5591 patients underwent ABR with a recurrence rate of 15.4% (n = 862). Age <35 years, participation in contact and collision sports, bony Bankart lesions and full-thickness rotator cuff tears increased the risk of recurrence (all p < 0.05). A single shoulder dislocation (compared to multiple dislocations) lowered the risk of recurrence (p < 0.05). Due to the unavailability of certain variables in some patients, a portion of the patient data had to be excluded before pooling the data set to create the algorithm. A total of 797 patients were included providing information on risk factors associated with recurrence. The discrimination (area under the receiver operating curve) ranged between 0.54 and 0.57 for prediction of recurrence.
Conclusion: ML was not able to predict the recurrence following ABR with the current available predictors. Despite a global coordinated effort, the heterogeneity of clinical data limited the predictive capabilities of the algorithm, emphasizing the need for standardized data collection methods in future studies.
Level of evidence: Level IV, retrospective cohort study.
Keywords: arthroscopic Bankart repair; artificial intelligence; dislocation; machine learning algorithm; recurrence; shoulder instability.
© 2024 The Author(s). Knee Surgery, Sports Traumatology, Arthroscopy published by John Wiley & Sons Ltd on behalf of European Society of Sports Traumatology, Knee Surgery and Arthroscopy.
Conflict of interest statement
Dr. Laurens J. H. Allaart reports personal fees from Stryker, ConMed, Exatech, PercisionOS, Parvizi Surgical Innovation and Reach Orthopedics and has a patent pending to Exatech, ConMed and Stryker. Dr. Geert Alexander Buijze reports grants from SECEC, Vivalto Sante, personal fees from Depuy‐Synthes. Dr. Michel P. J. van den Bekerom reports grants for clinical and research fellowships supported by Smith and Nephew. Dr. Laurent Lafosse reports personal fees from Stryker, Smith and Nephew, Depuy. Dr. Thibault Lafosse reports personal fees from Stryker, Smith and Nephew, Depuy. The other authors have nothing to disclose. None of the fees above were related to the current study.
References
-
- Alkaduhimi, H. , Connelly, J.W. , van Deurzen, D.F.P. , Eygendaal, D. & van den Bekerom, M.P.J. (2021) High variability of the definition of recurrent glenohumeral instability: an analysis of the current literature by a systematic review. Arthroscopy, Sports Medicine, and Rehabilitation, 3, e951–e966. Available from: 10.1016/j.asmr.2021.02.002 - DOI - PMC - PubMed
-
- Austin, P.C. & Steyerberg, E.W. (2017) Events per variable (EPV) and the relative performance of different strategies for estimating the out‐of‐sample validity of logistic regression models. Statistical Methods in Medical Research, 26, 796–808. Available from: 10.1177/0962280214558972 - DOI - PMC - PubMed
-
- Breiman, L. (2001) Random forests. Machine Learning, 45, 5–32. Available from: 10.1023/A:1010933404324 - DOI
-
- Collins, G.S. , Moons, K.G.M. , Dhiman, P. , Riley, R.D. , Beam, A.L. , Van Calster, B. , et al. (2024) TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods. BMJ, 385, e078378. Available from: 10.1136/bmj-2023-078378 - DOI - PMC - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
Research Materials
