Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar 29;79(12):1155-1166.
doi: 10.1016/j.jacc.2022.01.021.

Coronary Risk Estimation Based on Clinical Data in Electronic Health Records

Affiliations

Coronary Risk Estimation Based on Clinical Data in Electronic Health Records

Ben O Petrazzini et al. J Am Coll Cardiol. .

Abstract

Background: Clinical features from electronic health records (EHRs) can be used to build a complementary tool to predict coronary artery disease (CAD) susceptibility.

Objectives: The purpose of this study was to determine whether an EHR score can improve CAD prediction and reclassification 1 year before diagnosis, beyond conventional clinical guidelines as determined by the pooled cohort equations (PCE) and a polygenic risk score for CAD.

Methods: We applied a machine learning framework using clinical features from the EHR in a multiethnic, clinical care cohort (BioMe) comprising 555 CAD cases and 6,349 control subjects and in a population-based cohort (UK Biobank) comprising 3,130 CAD cases and 378,344 control subjects for external validation.

Results: Compared with the PCE, the EHR score improved CAD prediction by 12% in the BioMe Biobank and by 9% in the UK Biobank. The EHR score reclassified 25.8% and 15.2% individuals in each cohort respectively, compared with the PCE score. We observed larger improvements in the EHR score over the PCE in a subgroup of individuals with low CAD risk, with 20% increased discrimination and 34.4% increased reclassification. In all models, the polygenic risk score for CAD did not improve CAD prediction, compared with the PCE or EHR score.

Conclusions: The EHR score resulted in increased prediction and reclassification for CAD, demonstrating its potential use for population health monitoring of short-term CAD risk in large health systems.

Keywords: biobank; coronary artery disease; electronic health record; machine learning; polygenic risk score; pooled cohort equations; prevention.

PubMed Disclaimer

Conflict of interest statement

Funding Support and Author Disclosures Mr Forrest is supported by the National Institute of General Medical Sciences of the National Institutes of Health (NIH) (T32-GM007280). Dr Nadkarni is supported by a career development award from the NIH (K23-DK107908) and by NIH grants (R01-DK108803, U01-HG007278, U01-HG009610, and U01-DK116100); is scientific cofounder, consultant, advisory board member, and equity owner of Renalytix AI; is a scientific cofounder and equity holder for Pensieve Health; has served as a consultant for Variant Bio; has received grants from Goldfinch Bio; and has received personal fees from Renalytix AI, BioVie, Reata, AstraZeneca, and GLG Consulting. Dr Do is supported by the National Institute of General Medical Sciences of the NIH (R35-GM124836) and the National Heart, Lung, and Blood Institute of the NIH (R01-HL139865 and R01-HL155915); has received grants from AstraZeneca; has received grants and nonfinancial support from Goldfinch Bio; is a scientific cofounder, consultant, and equity holder for Pensieve Health; and has served as a consultant for Variant Bio. All other authors have reported that they have no relationships relevant to the contents of this paper to disclose. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.

Figures

FIGURE 1
FIGURE 1
Study Design and Flowchart The BioMe Biobank (A) was used for training and the UK Biobank (B) was used for external validation. Statin indicates cholesterol-lowering medications. AUROC = area under the receiver-operating characteristic curve; GBT = gradient boosted trees; NRI = net reclassification improvement; PCE = pooled cohort equations; RF = random forest; SVM = support vector machines.
FIGURE 2
FIGURE 2
Receiver-Operating Characteristic Curves for 1-Year CAD Risk Prediction Models The models were tested in BioMe (A), low-risk BioMe (B), UK Biobank (C), and low-risk UK Biobank (D) individuals. The y- and x-axes correspond to averaged true positive and false negative rates, respectively, across 100 iterations. The averaged area under the curve is indicated for every model. AUC = area under the curve; EHR = electronic health record; PC = principal component; PCE = pooled cohort equations; PRS = polygenic risk score.
FIGURE 3
FIGURE 3
Change in Prediction Score Between PCE model and EHR Model The models were tested in BioMe (A) and UK Biobank (B) individuals. Dot plot shows the averaged prediction of the EHR and PCE models for every tested individual in the BioMe and UK Biobank cohorts. Red and blue indicates actual cases and control subjects, respectively. Top and right histograms show the distribution of averaged PCE and EHR prediction scores, respectively. The secondary y-axes in the histograms show the percentage of control subjects predicted in a 1% wide bin of the prediction score. Dotted vertical line indicates the 85% threshold. Abbreviations as in Figure 2.
CENTRAL ILLUSTRATION
CENTRAL ILLUSTRATION
Electronic Health Record Clinical Features Predict Short-Term Coronary Artery Disease Risk Clinical features from the electronic health record can predict 1-year coronary artery disease risk with a discrimination power of 0.94 AUROC and reclassify 25.8% of predictions coming from the Pooled Cohort Equations. AUROC = area under the receiver-operating characteristic curve; CAD = coronary artery disease; EHR = electronic health record; NRI = net reclassification improvement; PCE = Pooled Cohort Equations.

Comment in

Similar articles

Cited by

References

    1. Zamorano JL, del Val D. Predictive models of atherosclerotic cardiovascular disease: in search of the philosopher’s stone of cardiology. J Am Coll Cardiol. 2016;67:148–150. - PubMed
    1. Goff DC Jr, Lloyd-Jones DM, Bennett G, et al. 2013 ACC/AHA guideline on the assessment of cardiovascular risk: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. J Am Coll Cardiol. 2014;63(25 Pt B):2935–2959. - PMC - PubMed
    1. Stone NJ, Robinson JG, Lichtenstein AH, et al. 2013 ACC/AHA guideline on the treatment of blood cholesterol to reduce atherosclerotic cardiovascular risk in adults: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. J Am Coll Cardiol. 2014;63(25 Pt B):2889–2934. - PubMed
    1. Ridker PM, Cook NR. Statins: new American guidelines for prevention of cardiovascular disease. Lancet. 2013;382:1762–1765. - PubMed
    1. Kavousi M, Leening MJG, Nanchen D, et al..Comparison of application of the ACC/AHA Guidelines, Adult Treatment Panel III Guidelines, and European Society of Cardiology Guidelines for Cardiovascular Disease Prevention in a European cohort. JAMA. 2014;311:1416–1423. - PubMed

Publication types