Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Dec 22;8(1):789.
doi: 10.1038/s41746-025-02235-4.

Development and evaluation of a machine learning model predicting out-of-hospital cardiac arrest using environmental factors

Affiliations

Development and evaluation of a machine learning model predicting out-of-hospital cardiac arrest using environmental factors

Takahiro Nakashima et al. NPJ Digit Med. .

Abstract

Development of an accurate system to predict the daily incidence of out-of-hospital cardiac arrest (OHCA) might provide a significant public health benefit. We developed a machine learning model to predict daily OHCA incidence at the regional level using high-resolution meteorological data, chronological data, and sociodemographic data. Among OHCAs of non-traumatic cause, 196,735 in the training dataset (2013-2017), 119,455 in the testing dataset (2018-2019) from internal areas, and 37,160 in the testing dataset from external areas were included in the analysis. Application of invariant causal prediction (ICP) successfully reduced the number of variables to 17 while maintaining high predictive performance for nationwide incidence per 100,000 per day in both the training and testing datasets, whether from internal or external regions, comparable to the non-ICP model using the initial 34 variables. Furthermore, the prediction model retained a satisfactory performance up to seven days in advance.

PubMed Disclaimer

Conflict of interest statement

Competing interests: The authors declare no competing interests. Patient and public involvement: No patients or members of the public were directly involved in the design, or conduct, or reporting, or dissemination plans of this research.

Figures

Fig. 1
Fig. 1. Observed versus predicted incidence of out-of-hospital cardiac arrest by various analysis methods.
The results obtained using each method, including GAM (A), XGBoost (B), CatBoost (C), and Random Forest (D), are presented. The light blue lines indicate the observed daily incidence per 100,000 of out-of-hospital cardiac arrests in the registry participating areas. The yellow lines indicate the predicted daily incidence per 100,000 based on combined meteorological, chronological, and sociodemographic variables. GAM generalized additive model, Jan means January, XGBoost eXtreme gradient boosting.
Fig. 2
Fig. 2. Observed versus predicted incidence of out-of-hospital cardiac arrest in the invariant causal prediction model at varying time intervals.
A shows the results of the training dataset obtained using the ICP model. B and C present the testing results for same-day, 3-day-ahead, and 7-day-ahead predictions in internal and external settings, respectively. The light blue lines indicate the observed daily incidence per 100,000 of out-of-hospital cardiac arrests in the registry-participating areas. The yellow lines indicate the predicted daily incidence per 100,000 by the XGBoost gradient boosting model using predictors selected by ICP. ICP denotes invariant causal prediction; Jan, January.
Fig. 3
Fig. 3. Importance of variables in a machine learning prediction model.
This figure shows a variable importance plot for meteorological variables (red), chronological variables (blue), and sociodemographic variables (black) in a machine learning prediction model using XGBoost. The yellow to purple dots in each row represent low to high values for each predictor normally scaled. The x-axis shows the Shapley value, indicating the variable’s impact on the model. Positive SHAP values tend to drive predictions toward more cases of OHCA and negative SHAP values tend to drive the prediction toward fewer cases of OHCA. * In the model, 2013 was considered year 0. OHCA denotes out-of-hospital cardiac arrest; SHAP, Shapley Additive Explanations; XGBoost, eXtreme Gradient Boosting.
Fig. 4
Fig. 4. Data sources and overview of the machine learning model for predicting out-of-hospital cardiac arrest incidence.
A illustrates the states and communities participating in the Cardiac Arrest Registry to Enhance Survival (CARES). B displays an example of daily maximum ambient temperature data on July 15, 2022, obtained from the North American Land Data Assimilation System (NLDAS). C summarizes the development process of the machine learning model used to predict out-of-hospital cardiac arrest incidence.

References

    1. Berdowski, J., Berg, R. A., Tijssen, J. G. & Koster, R. W. Global incidences of out-of-hospital cardiac arrest and survival rates: systematic review of 67 prospective studies. Resuscitation81, 1479–1487 (2010). - DOI - PubMed
    1. Yan, S. et al. The global survival rate among adult out-of-hospital cardiac arrest patients who received cardiopulmonary resuscitation: a systematic review and meta-analysis. Crit. Care24, 61 (2020). - DOI - PMC - PubMed
    1. Yamaji, K. et al. Relation of ST-segment elevation myocardial infarction to daily ambient temperature and air pollutant levels in a Japanese nationwide percutaneous coronary intervention registry. Am. J. Cardiol.119, 872–880 (2017). - DOI - PubMed
    1. Cold exposure and winter mortality from ischaemic heart disease, cerebrovascular disease, respiratory disease, and all causes in warm and cold regions of Europe. The Eurowinter Group. Lancet349, 1341–1346 (1997). - PubMed
    1. Wolf, K. et al. Air temperature and the occurrence of myocardial infarction in Augsburg, Germany. Circulation120, 735–742 (2009). - DOI - PubMed

LinkOut - more resources