Can machine-learning improve cardiovascular risk prediction using routine clinical data?
- PMID: 28376093
- PMCID: PMC5380334
- DOI: 10.1371/journal.pone.0174944
Can machine-learning improve cardiovascular risk prediction using routine clinical data?
Abstract
Background: Current approaches to predict cardiovascular risk fail to identify many people who would benefit from preventive treatment, while others receive unnecessary intervention. Machine-learning offers opportunity to improve accuracy by exploiting complex interactions between risk factors. We assessed whether machine-learning can improve cardiovascular risk prediction.
Methods: Prospective cohort study using routine clinical data of 378,256 patients from UK family practices, free from cardiovascular disease at outset. Four machine-learning algorithms (random forest, logistic regression, gradient boosting machines, neural networks) were compared to an established algorithm (American College of Cardiology guidelines) to predict first cardiovascular event over 10-years. Predictive accuracy was assessed by area under the 'receiver operating curve' (AUC); and sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) to predict 7.5% cardiovascular risk (threshold for initiating statins).
Findings: 24,970 incident cardiovascular events (6.6%) occurred. Compared to the established risk prediction algorithm (AUC 0.728, 95% CI 0.723-0.735), machine-learning algorithms improved prediction: random forest +1.7% (AUC 0.745, 95% CI 0.739-0.750), logistic regression +3.2% (AUC 0.760, 95% CI 0.755-0.766), gradient boosting +3.3% (AUC 0.761, 95% CI 0.755-0.766), neural networks +3.6% (AUC 0.764, 95% CI 0.759-0.769). The highest achieving (neural networks) algorithm predicted 4,998/7,404 cases (sensitivity 67.5%, PPV 18.4%) and 53,458/75,585 non-cases (specificity 70.7%, NPV 95.7%), correctly predicting 355 (+7.6%) more patients who developed cardiovascular disease compared to the established algorithm.
Conclusions: Machine-learning significantly improves accuracy of cardiovascular risk prediction, increasing the number of patients identified who could benefit from preventive treatment, while avoiding unnecessary treatment of others.
Conflict of interest statement
Figures


Similar articles
-
Predicting post-stroke pneumonia using deep neural network approaches.Int J Med Inform. 2019 Dec;132:103986. doi: 10.1016/j.ijmedinf.2019.103986. Epub 2019 Oct 1. Int J Med Inform. 2019. PMID: 31629312
-
Cardiovascular disease risk prediction using automated machine learning: A prospective study of 423,604 UK Biobank participants.PLoS One. 2019 May 15;14(5):e0213653. doi: 10.1371/journal.pone.0213653. eCollection 2019. PLoS One. 2019. PMID: 31091238 Free PMC article.
-
Dynamic prediction of psychological treatment outcomes: development and validation of a prediction model using routinely collected symptom data.Lancet Digit Health. 2021 Apr;3(4):e231-e240. doi: 10.1016/S2589-7500(21)00018-2. Lancet Digit Health. 2021. PMID: 33766287
-
A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models.J Clin Epidemiol. 2019 Jun;110:12-22. doi: 10.1016/j.jclinepi.2019.02.004. Epub 2019 Feb 11. J Clin Epidemiol. 2019. PMID: 30763612
-
Machine learning to predict adverse drug events based on electronic health records: a systematic review and meta-analysis.J Int Med Res. 2024 Dec;52(12):3000605241302304. doi: 10.1177/03000605241302304. J Int Med Res. 2024. PMID: 39668733 Free PMC article.
Cited by
-
Medicine 2032: The future of cardiovascular disease prevention with machine learning and digital health technology.Am J Prev Cardiol. 2022 Aug 29;12:100379. doi: 10.1016/j.ajpc.2022.100379. eCollection 2022 Dec. Am J Prev Cardiol. 2022. PMID: 36090536 Free PMC article. Review.
-
Combination of unsupervised discretization methods for credit risk.PLoS One. 2023 Nov 27;18(11):e0289130. doi: 10.1371/journal.pone.0289130. eCollection 2023. PLoS One. 2023. PMID: 38011207 Free PMC article.
-
Medication adherence prediction through temporal modelling in cardiovascular disease management.BMC Med Inform Decis Mak. 2022 Nov 29;22(1):313. doi: 10.1186/s12911-022-02052-9. BMC Med Inform Decis Mak. 2022. PMID: 36447245 Free PMC article.
-
Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension.Front Cardiovasc Med. 2022 Nov 14;9:1025705. doi: 10.3389/fcvm.2022.1025705. eCollection 2022. Front Cardiovasc Med. 2022. PMID: 36451926 Free PMC article.
-
Model for Predicting the Effect of Sibutramine Therapy in Obesity.J Pers Med. 2024 Jul 31;14(8):811. doi: 10.3390/jpm14080811. J Pers Med. 2024. PMID: 39202003 Free PMC article.
References
-
- World Health Organization. Global Status Report on Noncommunicable Diseases Geneva, Switzerland: World Health Organization, 2014.
-
- Goff DC, Lloyd-Jones DM, Bennett G, Coady S, D’Agostino RB, Gibbons R, et al. 2013 ACC/AHA Guideline on the Assessment of Cardiovascular Risk: A Report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. Circulation 2013; 135(11): 1–50. - PubMed
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical