Shapely additive values can effectively visualize pertinent covariates in machine learning when predicting hypertension
- PMID: 37971610
- PMCID: PMC10710553
- DOI: 10.1111/jch.14745
Shapely additive values can effectively visualize pertinent covariates in machine learning when predicting hypertension
Abstract
Machine learning methods are widely used within the medical field to enhance prediction. However, little is known about the reliability and efficacy of these models to predict long-term medical outcomes such as blood pressure using lifestyle factors, such as diet. The authors assessed whether machine-learning techniques could accurately predict hypertension risk using nutritional information. A cross-sectional study using data from the National Health and Nutrition Examination Survey (NHANES) between January 2017 and March 2020. XGBoost was used as the machine-learning model of choice in this study due to its increased performance relative to other common methods within medical studies. Model prediction metrics (e.g., AUROC, Balanced Accuracy) were used to measure overall model efficacy, covariate Gain statistics (percentage each covariate contributes to the overall prediction) and SHapely Additive exPlanations (SHAP, method to visualize each covariate) were used to provide explanations to machine-learning output and increase the transparency of this otherwise cryptic method. Of a total of 9650 eligible patients, the mean age was 41.02 (SD = 22.16), 4792 (50%) males, 4858 (50%) female, 3407 (35%) White patients, 2567 (27%) Black patients, 2108 (22%) Hispanic patients, and 981 (10%) Asian patients. From evaluation of model gain statistics, age was found to be the single strongest predictor of hypertension, with a gain of 53.1%. Additionally, demographic factors such as poverty and Black race were also strong predictors of hypertension, with gain of 4.33% and 4.18%, respectively. Nutritional Covariates contributed 37% to the overall prediction: Sodium, Caffeine, Potassium, and Alcohol intake being significantly represented within the model. Machine Learning can be used to predict hypertension.
Keywords: SHAP; cardiology; hypertension; machine learning; model transparency; statistics.
© 2023 The Authors. The Journal of Clinical Hypertension published by Wiley Periodicals LLC.
Conflict of interest statement
The authors have no financial or personal conflicts of interest.
Figures





Similar articles
-
Increasing transparency in machine learning through bootstrap simulation and shapely additive explanations.PLoS One. 2023 Feb 23;18(2):e0281922. doi: 10.1371/journal.pone.0281922. eCollection 2023. PLoS One. 2023. PMID: 36821544 Free PMC article.
-
Use of machine learning to identify risk factors for insomnia.PLoS One. 2023 Apr 12;18(4):e0282622. doi: 10.1371/journal.pone.0282622. eCollection 2023. PLoS One. 2023. PMID: 37043435 Free PMC article.
-
Use of machine learning to identify risk factors for coronary artery disease.PLoS One. 2023 Apr 14;18(4):e0284103. doi: 10.1371/journal.pone.0284103. eCollection 2023. PLoS One. 2023. PMID: 37058460 Free PMC article.
-
Exploring Depression and Nutritional Covariates Amongst US Adults using Shapely Additive Explanations.Health Sci Rep. 2023 Oct 20;6(10):e1635. doi: 10.1002/hsr2.1635. eCollection 2023 Oct. Health Sci Rep. 2023. PMID: 37867784 Free PMC article.
-
Interpretable machine learning model to predict surgical difficulty in laparoscopic resection for rectal cancer.Front Oncol. 2024 Feb 6;14:1337219. doi: 10.3389/fonc.2024.1337219. eCollection 2024. Front Oncol. 2024. PMID: 38380369 Free PMC article. Review.
Cited by
-
Association of dietary calcium intake with chronic bronchitis and emphysema.J Health Popul Nutr. 2025 Apr 2;44(1):102. doi: 10.1186/s41043-025-00843-0. J Health Popul Nutr. 2025. PMID: 40176178 Free PMC article.
-
Machine learning analysis of emerging risk factors for early-onset hypertension in the Tlalpan 2020 cohort.Front Cardiovasc Med. 2025 Jan 17;11:1434418. doi: 10.3389/fcvm.2024.1434418. eCollection 2024. Front Cardiovasc Med. 2025. PMID: 39896047 Free PMC article.
-
Negative association between body roundness index and constipation: insights from NHANES.J Health Popul Nutr. 2025 May 9;44(1):149. doi: 10.1186/s41043-025-00886-3. J Health Popul Nutr. 2025. PMID: 40346717 Free PMC article.
-
Evaluation of the responsiveness pattern to caffeine through a smart data-driven ECG non-linear multi-band analysis.Heliyon. 2024 May 23;10(11):e31721. doi: 10.1016/j.heliyon.2024.e31721. eCollection 2024 Jun 15. Heliyon. 2024. PMID: 38867964 Free PMC article.
-
Use machine learning models to identify and assess risk factors for coronary artery disease.PLoS One. 2024 Sep 6;19(9):e0307952. doi: 10.1371/journal.pone.0307952. eCollection 2024. PLoS One. 2024. PMID: 39240939 Free PMC article.
References
-
- Dobrowolski P, Prejbisz A, Kurylowicz A, et al. Metabolic syndrome—a new definition and management guidelines: a joint position paper by the Polish Society of Hypertension, Polish Society for the Treatment of Obesity, Polish Lipid Association, Polish Association for Study of Liver, Polish Society of Family Medicine, Polish Society of Lifestyle Medicine, Division of Prevention and Epidemiology Polish Cardiac Society, “Club 30” Polish Cardiac Society, and Division of Metabolic and Bariatric Surgery Society of Polish Surgeons. Arch Med Sci. 2022;18(5):1133‐1156. - PMC - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical