Predicting the Risk of Hypertension Based on Several Easy-to-Collect Risk Factors: A Machine Learning Method
- PMID: 34631636
- PMCID: PMC8497705
- DOI: 10.3389/fpubh.2021.619429
Predicting the Risk of Hypertension Based on Several Easy-to-Collect Risk Factors: A Machine Learning Method
Abstract
Hypertension is a widespread chronic disease. Risk prediction of hypertension is an intervention that contributes to the early prevention and management of hypertension. The implementation of such intervention requires an effective and easy-to-implement hypertension risk prediction model. This study evaluated and compared the performance of four machine learning algorithms on predicting the risk of hypertension based on easy-to-collect risk factors. A dataset of 29,700 samples collected through a physical examination was used for model training and testing. Firstly, we identified easy-to-collect risk factors of hypertension, through univariate logistic regression analysis. Then, based on the selected features, 10-fold cross-validation was utilized to optimize four models, random forest (RF), CatBoost, MLP neural network and logistic regression (LR), to find the best hyper-parameters on the training set. Finally, the performance of models was evaluated by AUC, accuracy, sensitivity and specificity on the test set. The experimental results showed that the RF model outperformed the other three models, and achieved an AUC of 0.92, an accuracy of 0.82, a sensitivity of 0.83 and a specificity of 0.81. In addition, Body Mass Index (BMI), age, family history and waist circumference (WC) are the four primary risk factors of hypertension. These findings reveal that it is feasible to use machine learning algorithms, especially RF, to predict hypertension risk without clinical or genetic data. The technique can provide a non-invasive and economical way for the prevention and management of hypertension in a large population.
Keywords: easy-to-collect; hypertension; lifestyle; machine learning method; risk prediction.
Copyright © 2021 Zhao, Zhang, Xu, Gao, Ma, Sun and Wang.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures
Similar articles
-
Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25. Artif Intell Med. 2019. PMID: 31521253
-
Evaluating the risk of hypertension in residents in primary care in Shanghai, China with machine learning algorithms.Front Public Health. 2022 Oct 4;10:984621. doi: 10.3389/fpubh.2022.984621. eCollection 2022. Front Public Health. 2022. PMID: 36267989 Free PMC article.
-
Patient-Level Prediction of Cardio-Cerebrovascular Events in Hypertension Using Nationwide Claims Data.J Med Internet Res. 2019 Feb 15;21(2):e11757. doi: 10.2196/11757. J Med Internet Res. 2019. PMID: 30767907 Free PMC article.
-
Predicting post-stroke pneumonia using deep neural network approaches.Int J Med Inform. 2019 Dec;132:103986. doi: 10.1016/j.ijmedinf.2019.103986. Epub 2019 Oct 1. Int J Med Inform. 2019. PMID: 31629312
-
Use of Non-invasive Parameters and Machine-Learning Algorithms for Predicting Future Risk of Type 2 Diabetes: A Retrospective Cohort Study of Health Data From Kuwait.Front Endocrinol (Lausanne). 2019 Sep 11;10:624. doi: 10.3389/fendo.2019.00624. eCollection 2019. Front Endocrinol (Lausanne). 2019. PMID: 31572303 Free PMC article.
Cited by
-
Bilateral Matching Method for Business Resources Based on Synergy Effects and Incomplete Data.Entropy (Basel). 2024 Aug 6;26(8):669. doi: 10.3390/e26080669. Entropy (Basel). 2024. PMID: 39202139 Free PMC article.
-
The relationship between anthropometric indices and the presence of hypertension in an Iranian population sample using data mining algorithms.J Hum Hypertens. 2024 Mar;38(3):277-285. doi: 10.1038/s41371-023-00877-z. Epub 2023 Dec 1. J Hum Hypertens. 2024. PMID: 38040904
-
Transforming Hypertension Diagnosis and Management in The Era of Artificial Intelligence: A 2023 National Heart, Lung, and Blood Institute (NHLBI) Workshop Report.Hypertension. 2025 Jan;82(1):36-45. doi: 10.1161/HYPERTENSIONAHA.124.22095. Epub 2024 Jul 16. Hypertension. 2025. PMID: 39011653 Review.
-
Next-visit prediction and prevention of hypertension using large-scale routine health checkup data.PLoS One. 2024 Nov 13;19(11):e0313658. doi: 10.1371/journal.pone.0313658. eCollection 2024. PLoS One. 2024. PMID: 39536036 Free PMC article.
-
Temporal analysis of non-communicable diseases and NCD-HIV/AIDS comorbidity in Malawi: A 4-year retrospective study 2020-2022.Trop Med Int Health. 2025 Aug;30(8):782-800. doi: 10.1111/tmi.14134. Epub 2025 Jun 3. Trop Med Int Health. 2025. PMID: 40461415 Free PMC article.
References
-
- Moreira MWL, Rodrigues JJPC, Korotaev V, Al-Muhtadi J, Kumar N. A comprehensive review on smart decision support systems for health care. Ieee Syst J. (2019) 13:3536–45. 10.1109/JSYST.2018.2890121 - DOI
-
- Simjanoska M, Kochev S, Tanevski J, Bogdanova AM, Papa G, Eftimov T. Multi-level information fusion for learning a blood pressure predictive model using sensor data. Inform Fusion. (2020) 58:24–39. 10.1016/j.inffus.2019.12.008 - DOI
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical