Using machine learning to detect sarcopenia from electronic health records
- PMID: 37654711
- PMCID: PMC10467215
- DOI: 10.1177/20552076231197098
Using machine learning to detect sarcopenia from electronic health records
Abstract
Introduction: Sarcopenia (low muscle mass and strength) causes dysmobility and loss of independence. Sarcopenia is often not directly coded or described in electronic health records (EHR). The objective was to improve sarcopenia detection using structured data from EHR.
Methods: Adults undergoing musculoskeletal testing (December 2017-March 2020) were classified as meeting sarcopenia thresholds for 0 (controls), ≥1 (Sarcopenia-1), or ≥2 (Sarcopenia-2) tests. Electronic health record diagnoses, medications, and laboratory testing were extracted from the Indiana Network for Patient Care. Five machine learning models were applied to EHR data for predicting sarcopenia.
Results: Of 1304 participants, 1055 were controls, 249 met Sarcopenia-1 and 76 met Sarcopenia-2. Sarcopenic participants were older, with higher fat mass, Charlson Comorbidity Index, and more chronic diseases. All models performed better for Sarcopenia-2 than Sarcopenia-1. The top performing models for Sarcopenia-1 were Logistic Regression [area under the curve (AUC) 71.59 (95% confidence interval [CI], 71.51-71.66)] and Multi-Layer Perceptron [AUC 71.48 (95%CI, 71.00-71.97)]. The top performing models for Sarcopenia-2 were Logistic Regression [AUC 91.44 (95%CI, 91.28-91.60)] and Support Vector Machine [AUC 90.81 (95%CI, 88.41-93.20)]. For the best Logistic Regression Model, important sarcopenia predictors included diabetes mellitus, digestive system complaints, signs and symptoms involving the nervous, musculoskeletal and respiratory systems, metabolic disorders, and kidney or urinary tract disorders. Opioids, corticosteroids, and antihyperlipidemic drugs were also more common among sarcopenic participants.
Conclusions: Applying machine learning models, sarcopenia can be predicted from structured data in EHR, which may be developed through future studies to facilitate large-scale early detection and intervention in clinical populations.
Keywords: Sarcopenia; health informatics; machine learning; musculoskeletal.
© The Author(s) 2023.
Figures

Similar articles
-
Electronic health record phenotyping improves detection and screening of type 2 diabetes in the general United States population: A cross-sectional, unselected, retrospective study.J Biomed Inform. 2016 Apr;60:162-8. doi: 10.1016/j.jbi.2015.12.006. Epub 2015 Dec 17. J Biomed Inform. 2016. PMID: 26707455 Free PMC article.
-
Detecting rare diseases in electronic health records using machine learning and knowledge engineering: Case study of acute hepatic porphyria.PLoS One. 2020 Jul 2;15(7):e0235574. doi: 10.1371/journal.pone.0235574. eCollection 2020. PLoS One. 2020. PMID: 32614911 Free PMC article.
-
Sarcopenia, frailty and cachexia patients detected in a multisystem electronic health record database.BMC Musculoskelet Disord. 2020 Jul 31;21(1):508. doi: 10.1186/s12891-020-03522-9. BMC Musculoskelet Disord. 2020. PMID: 32736613 Free PMC article.
-
A novel electronic health record-based, machine-learning model to predict severe hypoglycemia leading to hospitalizations in older adults with diabetes: A territory-wide cohort and modeling study.PLoS Med. 2024 Apr 12;21(4):e1004369. doi: 10.1371/journal.pmed.1004369. eCollection 2024 Apr. PLoS Med. 2024. PMID: 38607977 Free PMC article.
-
Adult patient access to electronic health records.Cochrane Database Syst Rev. 2021 Feb 26;2(2):CD012707. doi: 10.1002/14651858.CD012707.pub2. Cochrane Database Syst Rev. 2021. PMID: 33634854 Free PMC article.
Cited by
-
Development of a visualized risk prediction system for sarcopenia in older adults using machine learning: a cohort study based on CHARLS.Front Public Health. 2025 Mar 12;13:1544894. doi: 10.3389/fpubh.2025.1544894. eCollection 2025. Front Public Health. 2025. PMID: 40144970 Free PMC article.
-
A machine learning-based online web calculator to aid in the diagnosis of sarcopenia in the US community.Digit Health. 2024 Sep 27;10:20552076241283247. doi: 10.1177/20552076241283247. eCollection 2024 Jan-Dec. Digit Health. 2024. PMID: 39360239 Free PMC article.
-
Sarcopenia prediction model based on machine learning and SHAP values for community-based older adults with cardiovascular disease in China.Front Public Health. 2025 May 21;13:1527304. doi: 10.3389/fpubh.2025.1527304. eCollection 2025. Front Public Health. 2025. PMID: 40469611 Free PMC article.
-
Height estimation in children and adolescents using body composition big data: Machine-learning and explainable artificial intelligence approach.Digit Health. 2025 Mar 28;11:20552076251331879. doi: 10.1177/20552076251331879. eCollection 2025 Jan-Dec. Digit Health. 2025. PMID: 40162169 Free PMC article.
-
Predictive modeling of lean body mass, appendicular lean mass, and appendicular skeletal muscle mass using machine learning techniques: A comprehensive analysis utilizing NHANES data and the Look AHEAD study.PLoS One. 2024 Sep 6;19(9):e0309830. doi: 10.1371/journal.pone.0309830. eCollection 2024. PLoS One. 2024. PMID: 39240958 Free PMC article.
References
-
- Sandberg C, Johansson K, Christersson C, et al. Sarcopenia is common in adults with complex congenital heart disease. Int J Cardiol 2019; 296: 57–62. - PubMed
-
- Silva TLD, Mulder AP. Sarcopenia and poor muscle quality associated with severe obesity in young adults and middle-aged adults. Clin Nutr ESPEN 2021; 45: 299–305. - PubMed
-
- Rolland Y, Abellan van Kan G, Gillette-Guyonnet Set al. et al. Cachexia versus sarcopenia. Curr Opin Clin Nutr Metab Care 2011; 14: 15–21. - PubMed
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous