Osteoporosis Feature Selection and Risk Prediction Model by Machine Learning Using a Cross-Sectional Database
- PMID: 37718904
- PMCID: PMC10509024
- DOI: 10.11005/jbm.2023.30.3.263
Osteoporosis Feature Selection and Risk Prediction Model by Machine Learning Using a Cross-Sectional Database
Abstract
Background: The purpose of this study was to verify the accuracy and validity of using machine learning (ML) to select risk factors, to discriminate differences in feature selection by ML between men and women, and to develop predictive models for patients with osteoporosis in a big database.
Methods: The data on 968 observed features from a total of 3,484 the Korea National Health and Nutrition Examination Survey participants were collected. To find preliminary features that were well-related to osteoporosis, logistic regression, random forest, gradient boosting, adaptive boosting, and support vector machine were used.
Results: In osteoporosis feature selection by 5 ML models in this study, the most selected variables as risk factors in men and women were body mass index, monthly alcohol consumption, and dietary surveys. However, differences between men and women in osteoporosis feature selection by ML models were age, smoking, and blood glucose level. The receiver operating characteristic (ROC) analysis revealed that the area under the ROC curve for each ML model was not significantly different for either gender.
Conclusions: ML performed a feature selection of osteoporosis, considering hidden differences between men and women. The present study considers the preprocessing of input data and the feature selection process as well as the ML technique to be important factors for the accuracy of the osteoporosis prediction model.
Keywords: Machine learning; Osteoporosis; Risk assessment; Risk factors.
Conflict of interest statement
No potential conflict of interest relevant to this article was reported.
Figures




Similar articles
-
Application of machine learning approaches for osteoporosis risk prediction in postmenopausal women.Arch Osteoporos. 2020 Oct 23;15(1):169. doi: 10.1007/s11657-020-00802-8. Arch Osteoporos. 2020. PMID: 33097976
-
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251. Clin Orthop Relat Res. 2020. PMID: 32282466 Free PMC article.
-
Sarcopenia feature selection and risk prediction using machine learning: A cross-sectional study.Medicine (Baltimore). 2019 Oct;98(43):e17699. doi: 10.1097/MD.0000000000017699. Medicine (Baltimore). 2019. PMID: 31651901 Free PMC article.
-
Osteoporosis Pre-Screening Using Ensemble Machine Learning in Postmenopausal Korean Women.Healthcare (Basel). 2022 Jun 14;10(6):1107. doi: 10.3390/healthcare10061107. Healthcare (Basel). 2022. PMID: 35742158 Free PMC article.
-
Application of Machine Learning to Identify Clinically Meaningful Risk Group for Osteoporosis in Individuals Under the Recommended Age for Dual-Energy X-Ray Absorptiometry.Calcif Tissue Int. 2021 Dec;109(6):645-655. doi: 10.1007/s00223-021-00880-x. Epub 2021 Jun 30. Calcif Tissue Int. 2021. PMID: 34195852
Cited by
-
Relationship between albumin-corrected anion gap and lumbar spine bone mineral density: a cross-sectional study.Front Aging. 2025 Feb 11;6:1511294. doi: 10.3389/fragi.2025.1511294. eCollection 2025. Front Aging. 2025. PMID: 40007853 Free PMC article.