Machine learning models for predicting blood pressure phenotypes by combining multiple polygenic risk scores
- PMID: 38816422
- PMCID: PMC11139858
- DOI: 10.1038/s41598-024-62945-9
Machine learning models for predicting blood pressure phenotypes by combining multiple polygenic risk scores
Abstract
We construct non-linear machine learning (ML) prediction models for systolic and diastolic blood pressure (SBP, DBP) using demographic and clinical variables and polygenic risk scores (PRSs). We developed a two-model ensemble, consisting of a baseline model, where prediction is based on demographic and clinical variables only, and a genetic model, where we also include PRSs. We evaluate the use of a linear versus a non-linear model at both the baseline and the genetic model levels and assess the improvement in performance when incorporating multiple PRSs. We report the ensemble model's performance as percentage variance explained (PVE) on a held-out test dataset. A non-linear baseline model improved the PVEs from 28.1 to 30.1% (SBP) and 14.3% to 17.4% (DBP) compared with a linear baseline model. Including seven PRSs in the genetic model computed based on the largest available GWAS of SBP/DBP improved the genetic model PVE from 4.8 to 5.1% (SBP) and 4.7 to 5% (DBP) compared to using a single PRS. Adding additional 14 PRSs computed based on two independent GWASs further increased the genetic model PVE to 6.3% (SBP) and 5.7% (DBP). PVE differed across self-reported race/ethnicity groups, with primarily all non-White groups benefitting from the inclusion of additional PRSs. In summary, non-linear ML models improves BP prediction in models incorporating diverse populations.
© 2024. The Author(s).
Conflict of interest statement
B Psaty serves on the Steering Committee of the Yale Open Data Access Project funded by Johnson & Johnson. G Lyons is currently a full time employee of Alexion, AstraZeneca Rare Disease, and hold stock in the company, however, her contributions to the present manuscript were performed as part of her previous affiliation at the Harvard T.H. Chan School of Public Health and this work is not related to her current occupation and affiliation. M Moll has received grant funding from Bayer and consulting fees from TriNetX, 2ndMD, TheaHealth, Sitka, Verona Pharma, and Axon Advisors. All other authors report no competing interests.
Figures





Update of
-
Machine learning models for blood pressure phenotypes combining multiple polygenic risk scores.medRxiv [Preprint]. 2023 Dec 14:2023.12.13.23299909. doi: 10.1101/2023.12.13.23299909. medRxiv. 2023. Update in: Sci Rep. 2024 May 30;14(1):12436. doi: 10.1038/s41598-024-62945-9. PMID: 38168328 Free PMC article. Updated. Preprint.
Similar articles
-
Machine learning models for blood pressure phenotypes combining multiple polygenic risk scores.medRxiv [Preprint]. 2023 Dec 14:2023.12.13.23299909. doi: 10.1101/2023.12.13.23299909. medRxiv. 2023. Update in: Sci Rep. 2024 May 30;14(1):12436. doi: 10.1038/s41598-024-62945-9. PMID: 38168328 Free PMC article. Updated. Preprint.
-
Evaluating the use of blood pressure polygenic risk scores across race/ethnic background groups.Nat Commun. 2023 Jun 2;14(1):3202. doi: 10.1038/s41467-023-38990-9. Nat Commun. 2023. PMID: 37268629 Free PMC article.
-
Preeclampsia prediction with maternal and paternal polygenic risk scores: the TMM BirThree Cohort Study.Sci Rep. 2025 Apr 21;15(1):13743. doi: 10.1038/s41598-025-97291-x. Sci Rep. 2025. PMID: 40258933 Free PMC article.
-
Polygenic risk scores in kidney transplantation.Curr Opin Organ Transplant. 2025 Jun 1;30(3):208-214. doi: 10.1097/MOT.0000000000001212. Epub 2025 Apr 1. Curr Opin Organ Transplant. 2025. PMID: 40171629 Review.
-
Implementation and implications for polygenic risk scores in healthcare.Hum Genomics. 2021 Jul 20;15(1):46. doi: 10.1186/s40246-021-00339-y. Hum Genomics. 2021. PMID: 34284826 Free PMC article. Review.
Cited by
-
Computational Biology in the Discovery of Biomarkers in the Diagnosis, Treatment and Management of Cardiovascular Diseases.Cardiol Cardiovasc Med. 2024;8(5):405-414. Epub 2024 Sep 5. Cardiol Cardiovasc Med. 2024. PMID: 39328401 Free PMC article.
-
Advancements in genetic research by the Hispanic Community Health Study/Study of Latinos: A 10-year retrospective review.HGG Adv. 2025 Jan 9;6(1):100376. doi: 10.1016/j.xhgg.2024.100376. Epub 2024 Oct 29. HGG Adv. 2025. PMID: 39473183 Free PMC article. Review.
References
-
- Tibshirani R. Regression shrinkage and selection via the Lasso. J. R. Stat. Soc. Ser. B (Methodol.) 1996;58(1):267–288. doi: 10.1111/j.2517-6161.1996.tb02080.x. - DOI