Sex-specific cardiovascular risk factors in the UK Biobank
- PMID: 39165282
- PMCID: PMC11333928
- DOI: 10.3389/fphys.2024.1339866
Sex-specific cardiovascular risk factors in the UK Biobank
Abstract
The lack of sex-specific cardiovascular disease criteria contributes to the underdiagnosis of women compared to that of men. For more than half a century, the Framingham Risk Score has been the gold standard to estimate an individual's risk of developing cardiovascular disease based on the age, sex, cholesterol levels, blood pressure, diabetes status, and the smoking status. Now, machine learning can offer a much more nuanced insight into predicting the risk of cardiovascular diseases. The UK Biobank is a large database that includes traditional risk factors and tests related to the cardiovascular system: magnetic resonance imaging, pulse wave analysis, electrocardiograms, and carotid ultrasounds. Here, we leverage 20,542 datasets from the UK Biobank to build more accurate cardiovascular risk models than the Framingham Risk Score and quantify the underdiagnosis of women compared to that of men. Strikingly, for a first-degree atrioventricular block and dilated cardiomyopathy, two conditions with non-sex-specific diagnostic criteria, our study shows that women are under-diagnosed 2× and 1.4× more than men. Similarly, our results demonstrate the need for sex-specific criteria in essential primary hypertension and hypertrophic cardiomyopathy. Our feature importance analysis reveals that out of the top 10 features across three sexes and four disease categories, traditional Framingham factors made up between 40% and 50%; electrocardiogram, 30%-33%; pulse wave analysis, 13%-23%; and magnetic resonance imaging and carotid ultrasound, 0%-10%. Improving the Framingham Risk Score by leveraging big data and machine learning allows us to incorporate a wider range of biomedical data and prediction features, enhance personalization and accuracy, and continuously integrate new data and knowledge, with the ultimate goal to improve accurate prediction, early detection, and early intervention in cardiovascular disease management. Our analysis pipeline and trained classifiers are freely available at https://github.com/LivingMatterLab/CardiovascularDiseaseClassification.
Keywords: UK Biobank; cardiovascular; heart disease; risk factors; sex differences.
Copyright © 2024 St. Pierre, Kaczmarski, Peirlinck and Kuhl.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.
Figures





Similar articles
-
Derivation and validation of QRISK, a new cardiovascular disease risk score for the United Kingdom: prospective open cohort study.BMJ. 2007 Jul 21;335(7611):136. doi: 10.1136/bmj.39261.471806.55. Epub 2007 Jul 5. BMJ. 2007. PMID: 17615182 Free PMC article.
-
Prediction of cardiovascular risk factors from retinal fundus photographs: Validation of a deep learning algorithm in a prospective non-interventional study in Kenya.Diabetes Obes Metab. 2024 Jul;26(7):2722-2731. doi: 10.1111/dom.15587. Epub 2024 Apr 15. Diabetes Obes Metab. 2024. PMID: 38618987
-
Should pulse pressure become part of the Framingham risk score?J Hum Hypertens. 2004 Apr;18(4):279-86. doi: 10.1038/sj.jhh.1001669. J Hum Hypertens. 2004. PMID: 15037878
-
Predicting cardiovascular risk in England and Wales: prospective derivation and validation of QRISK2.BMJ. 2008 Jun 28;336(7659):1475-82. doi: 10.1136/bmj.39609.449676.25. Epub 2008 Jun 23. BMJ. 2008. PMID: 18573856 Free PMC article.
-
[Cardiovascular risk and cardiometabolic risk: an epidemiological evaluation].G Ital Cardiol (Rome). 2008 Apr;9(4 Suppl 1):6S-17S. G Ital Cardiol (Rome). 2008. PMID: 18773746 Review. Italian.
References
-
- Abadi M., Agarwal A., Barham P., Brevdo E., Chen Z., Citro C., et al. (2016). TensorFlow: large-scale machine learning on heterogeneous distributed systems. arXiv:10.48550/arXiv.1603.04467 - DOI
-
- Aceña V., Martín de Diego I., Fernández R. R., Moguerza M. J. (2022). Minimally overfitted learners: a general framework for ensemble learning. Knowledge-Based Syst. 254, 109669. 10.1016/j.knosys.2022.109669 - DOI
-
- Alber M., Buganza Tepole A., Cannon W., De S., Dura-Bernal S., Garikipati K., et al. (2019). Integrating machine learning and multiscale modeling: perspectives, challenges, and opportunities in the biological, biomedical, and behavioral sciences. npj Digit. Med. 2, 115. 10.1038/s41746-019-0193-y - DOI - PMC - PubMed
-
- Arik S. O., Pfister T. (2020). Tabnet: attentive interpretable tabular learning. arXiv :10.48550/arXiv.1908.07442 - DOI
LinkOut - more resources
Full Text Sources