Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Jun 15;43(12):853-860.
doi: 10.1097/BRS.0000000000002442.

Examining the Ability of Artificial Neural Networks Machine Learning Models to Accurately Predict Complications Following Posterior Lumbar Spine Fusion

Affiliations

Examining the Ability of Artificial Neural Networks Machine Learning Models to Accurately Predict Complications Following Posterior Lumbar Spine Fusion

Jun S Kim et al. Spine (Phila Pa 1976). .

Abstract

Study design: A cross-sectional database study.

Objective: The aim of this study was to train and validate machine learning models to identify risk factors for complications following posterior lumbar spine fusion.

Summary of background data: Machine learning models such as artificial neural networks (ANNs) are valuable tools for analyzing and interpreting large and complex datasets. ANNs have yet to be used for risk factor analysis in orthopedic surgery.

Methods: The American College of Surgeons National Surgical Quality Improvement Program (ACS-NSQIP) database was queried for patients who underwent posterior lumbar spine fusion. This query returned 22,629 patients, 70% of whom were used to train our models, and 30% were used to evaluate the models. The predictive variables used included sex, age, ethnicity, diabetes, smoking, steroid use, coagulopathy, functional status, American Society for Anesthesiology (ASA) class ≥3, body mass index (BMI), pulmonary comorbidities, and cardiac comorbidities. The models were used to predict cardiac complications, wound complications, venous thromboembolism (VTE), and mortality. Using ASA class as a benchmark for prediction, area under receiver operating curves (AUC) was used to determine the accuracy of our machine learning models.

Results: On the basis of AUC values, ANN and LR both outperformed ASA class for predicting all four types of complications. ANN was the most accurate for predicting cardiac complications, and LR was most accurate for predicting wound complications, VTE, and mortality, though ANN and LR had comparable AUC values for predicting all types of complications. ANN had greater sensitivity than LR for detecting wound complications and mortality.

Conclusion: Machine learning in the form of logistic regression and ANNs were more accurate than benchmark ASA scores for identifying risk factors of developing complications following posterior lumbar spine fusion, suggesting they are potentially great tools for risk factor analysis in spine surgery.

Level of evidence: 3.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
(A) Schematic of study workflow. (B) Diagram of ANN model. Bar lengths represent number of patient cases. ADASYN increases the number of positive cases to combat class imbalance. Negative cases are then partitioned in a 1:1 ratio with the positive cases to create a class-balanced dataset used for ANN training. Each partition trains an independent neural net. During evaluation, data are fed through each neural net where the responses are surveyed, weighted by the model’s accuracy, and the net prediction is used.
Figure 2.
Figure 2.
Coefficient weights obtained from logistic regression analysis used for feature selection. Dark cells indicate highly weighted features indicating a strong predictive value, and lighter cells indicate weakly weighted features.
Figure 3.
Figure 3.
Receiver operating curves for ASA, LR, and ANN for each complication type.
Figure 4.
Figure 4.
Heatmap of AUC values from LR, ANN, and ASA when predicting cardiac complications (cardiac), VTE, wound complications (wound), and mortality.
Figure 5.
Figure 5.
Confusion matrices of trained ANN and LR machine learners evaluated on hold-out (A) mortality and (B) wound complication data sets to demonstrate real-world performance.

References

    1. Mobbs RJ, Phan K, Malham G, et al. Lumbar interbody fusion: techniques, indications and comparison of interbody fusion options including PLIF, TLIF, MI-TLIF, OLIF/ATP, LLIF and ALIF. J Spine Surg 2015;1:2–18. - PMC - PubMed
    1. Rompe JD, Eysel P, Hopf C. Clinical efficacy of pedicle instrumentation and posterolateral fusion in the symptomatic degenerative lumbar spine. Eur Spine J 1995;4:231–7. - PubMed
    1. Kumar MN, Jacquot F, Hall H. Long-term follow-up of functional outcomes and radiographic changes at adjacent levels following lumbar spine fusion for degenerative disc disease. Eur Spine J 2001;10:309–13. - PMC - PubMed
    1. White SF, Asher MA, Lai SM, et al. Patients’ perceptions of overall function, pain, and appearance after primary posterior instrumentation and fusion for idiopathic scoliosis. Spine (Phila Pa 1976) 1996;24:1699–700. - PubMed
    1. Lowe TG, Tahernia AD, O’Brien MF, et al. Unilateral transforaminal posterior lumbar interbody fusion (TLIF): indications, technique, and 2-year results. J Spinal Disord Tech 2002;15:31–8. - PubMed