Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Dec 10;15(12):e0243558.
doi: 10.1371/journal.pone.0243558. eCollection 2020.

Mapping risk of ischemic heart disease using machine learning in a Brazilian state

Affiliations

Mapping risk of ischemic heart disease using machine learning in a Brazilian state

Marcela Bergamini et al. PLoS One. .

Abstract

Cardiovascular diseases are the leading cause of deaths globally. Machine learning studies predicting mortality rates for ischemic heart disease (IHD) at the municipal level are very limited. The goal of this paper was to create and validate a Heart Health Care Index (HHCI) to predict risk of IHD based on location and risk factors. Secondary data, geographical information system (GIS) and machine learning were used to validate the HHCI and stratify the IHD municipality risk in the state of Paraná. A positive spatial autocorrelation was found (Moran's I = 0.6472, p-value = 0.001), showing clusters of high IHD mortality. The Support Vector Machine, which had an RMSE of 0.789 and error proportion close to one (0.867), was the best for prediction among eight machine learning algorithms after validation. In the north and northwest regions of the state, HHCI was low and mortality clusters patterns were high. By creating an HHCI through ML, we can predict IHD mortality rate at municipal level, identifying predictive characteristics that impact health conditions of these localities' guided health management decisions for improvements for IHD within the emergency care network in the state of Paraná.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Representative flowchart of the main machine learning model development stages.
1) Variables pre-processing (correlation and variation tests); 2) Predictive variables cross-validation; 3) Machine learning models discrimination and validation (internal and external validation); 4) Heart Health Care Index (HHCI) calculation and mapping.
Fig 2
Fig 2. Spatial exploratory analysis of municipalities’ IHD mortality rates.
A—Distribution of spatially IHD mortality rates observed in the state of Paraná from 2009–2015 in quantiles by municipality. B—Local Indicators of Spatial Association (LISA) of IHD mortality rates by municipality.
Fig 3
Fig 3. Calibration graphs of the tested predictive models (adjustments using RMSE).
A- Example of underfitting calibration model with the worst adjustment (K-Nearest Neighbors); B- Example of overfitting calibration model (Random Forest); C: Example of best fit model (Support Vector Machine). Blue represents the observed mortality rate and red represents the predicted one.
Fig 4
Fig 4. RMSE distribution representation.
A—PCR model (2015) B—SVM model (2015).
Fig 5
Fig 5. Distribution of the generated municipalities index.
1 lower risk and 0 higher risk.

References

    1. WHO, International Statistical Classification of Diseases and Related Health Problems 10th Revision (ICD-10); Version 2016. Disponível online: https://icd.who.int/browse10/2016/en#/I20-I25 (Acesso: 30 de janeiro).
    1. DATASUS–Departamento de Informática do SUS. Informações de Saúde, Epidemiológicas e Morbidade: banco de dados. Disponível em: http://www2.datasus.gov.br/DATASUS/index.php?area=0203 (Acesso: 14 jul. 2018).
    1. Hertz JT, Fu T, Vissoci JR, Augusto T, Rocha H, Carvalho E, et al. The distribution of cardiac diagnostic testing for acute coronary syndrome in the Brazilian healthcare system : A national geospatial evaluation of health access. PLoS One. 2019;14(1):1–16. 10.1371/journal.pone.0210502 - DOI - PMC - PubMed
    1. Goldstein BA, Navar AM, Carter RE. Moving beyond regression techniques in cardiovascular risk prediction: Applying machine learning to address analytic challenges. Eur Heart J. 2017;38(23):1805–14. 10.1093/eurheartj/ehw302 - DOI - PMC - PubMed
    1. Seligman B, Tuljapurkar S, Rehkopf D. Machine learning approaches to the social determinants of health in the health and retirement study. SSM—Popul Heal [Internet]. 2018;4 (November 2017):95–9. Available from: 10.1016/j.ssmph.2017.11.008 - DOI - PMC - PubMed

Publication types