Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 26:18:4219-4232.
doi: 10.2147/JMDH.S519195. eCollection 2025.

Random Forest Regression May Become the Optimal Regression Model for Osteoarthritis of the Knee in Elderly, in the Context of Embodied Cognition and Psychosomatic Medicine

Affiliations

Random Forest Regression May Become the Optimal Regression Model for Osteoarthritis of the Knee in Elderly, in the Context of Embodied Cognition and Psychosomatic Medicine

Guangyuan Ma et al. J Multidiscip Healthc. .

Abstract

Background: In the context of embodied cognition and psychosomatic medicine, predicting post-treatment depression in elderly patients with knee osteoarthritis (KOA) is critical for improving psychological outcomes. While regression analysis is widely used in longitudinal medical studies, the optimal model for forecasting complex psychosomatic changes remains unclear.

Objective: This study compared the predictive performance of five regression models in estimating post-treatment depression (D2) among elderly KOA patients, considering variables such as gender, age, pain, anxiety, sleep quality, and baseline depression.

Methods: A total of 106 elderly KOA patients from the Affiliated Hospital of Southwest Medical University were assessed before and after treatment (September 2023 to February 2024). Psychological and physical metrics included the Visual Analog Scale (VAS), Beck Anxiety Inventory (BAI), Geriatric Depression Scale (GDS), and Pittsburgh Sleep Quality Index (PSQI). Five regression techniques-non-negative linear regression, stochastic gradient descent (SGD), AdaBoost, Random Forest, and Gradient Boosting Decision Trees (GBDT)-were evaluated using R², mean squared error (MSE), and mean absolute error (MAE). Bootstrap resampling and the Kruskal-Wallis test were applied to ensure robustness and compare model coefficients.

Results: Random Forest regression achieved the highest performance (R² = 0.687, MSE = 0.589, MAE = 0.785), followed by AdaBoost. Post-treatment anxiety and sleep quality emerged as the strongest predictors. All models showed acceptable multicollinearity (VIF < 10), and Kruskal-Wallis results suggested no significant differences in coefficients across models.

Conclusion: Random forest regression outperformed other models in predicting depression after KOA treatment, demonstrating its strength in capturing complex nonlinear relationships. However, the study's relatively small sample size and predominantly female cohort may limit generalizability. Future research with larger and more diverse samples is recommended to validate these findings.

Keywords: elderly patients; knee osteoarthritis; longitudinal analysis; post-treatment depression; psychosomatic outcomes; random forest; regression models.

PubMed Disclaimer

Conflict of interest statement

The authors have no relevant financial or non-financial interests to disclose for this work.

Figures

Figure 1
Figure 1
Comparison of distributions at two time points.
Figure 2
Figure 2
Correlation heatmap. **Correlation is significant at the 0.01 level (two-tailed). *Correlation is significant at the 0.05 level (two-tailed). P1 is pre-treatment pain; D1 is pre-treatment depression; A1 is pre-treatment anxiety; S1 is pre-treatment sleep; P2 is post-treatment pain; D2 is post-treatment depression; A2 post-treatment anxiety; S2 is post-treatment sleep.
Figure 3
Figure 3
Comparison of R-squared, MSE, and MAE values for five regression models.
Figure 4
Figure 4
Comparison of R-squared, MSE, and MAE confidence intervals for five regression models.

Similar articles

References

    1. Cleophas T, Zwinderman A. Regression Analysis in Medical Research, for Starters and 2nd Levelers; New York: Springer;2021. doi: 10.1007/978-3-030-61394-5 - DOI
    1. West SG, Aiken LS, Cham H, Liu Y. 227Multiple Regression: the Basics and Beyond for Clinical Scientists. In: Comer JS, Kendall PC editors. The Oxford Handbook of Research Strategies for Clinical Psychology. Oxford University Press;2013. doi: 10.1093/oxfordhb/9780199793549.013.0013 - DOI
    1. Wu W, Selig JP, Little TD. 387Longitudinal Data Analysis. In: Little TD editor. The Oxford Handbook of Quantitative Methods in Psychology: Vol. 2: Statistical Analysis. Oxford University Press; 2013. doi: 10.1093/oxfordhb/9780199934898.013.0018 - DOI
    1. Ravani P, Barrett B, Parfrey P. Modeling Longitudinal Data, II: standard Regression Models and Extensions. In: Barrett B, Parfrey P, editors. Clinical Epidemiology: Practice and Methods. Totowa, NJ: Humana Press; 2009:61–94. doi: 10.1007/978-1-59745-385-1_4 - DOI - PubMed
    1. Minqiang Z, Ping F. Psychological Statistics. The ECPH Encyclopedia of Psychology. Singapore:Springer Nature Singapore; 2024:1–5. doi: 10.1007/978-981-99-6000-2_492-1 - DOI

LinkOut - more resources