Real-Time Classification of Causes of Death Using AI: Sensitivity Analysis
- PMID: 38875558
- PMCID: PMC11041420
- DOI: 10.2196/40965
Real-Time Classification of Causes of Death Using AI: Sensitivity Analysis
Abstract
Background: In 2021, the European Union reported >270,000 excess deaths, including >16,000 in Portugal. The Portuguese Directorate-General of Health developed a deep neural network, AUTOCOD, which determines the primary causes of death by analyzing the free text of physicians' death certificates (DCs). Although AUTOCOD's performance has been established, it remains unclear whether its performance remains consistent over time, particularly during periods of excess mortality.
Objective: This study aims to assess the sensitivity and other performance metrics of AUTOCOD in classifying underlying causes of death compared with manual coding to identify specific causes of death during periods of excess mortality.
Methods: We included all DCs between 2016 and 2019. AUTOCOD's performance was evaluated by calculating various performance metrics, such as sensitivity, specificity, positive predictive value (PPV), and F1-score, using a confusion matrix. This compared International Statistical Classification of Diseases and Health-Related Problems, 10th Revision (ICD-10), classifications of DCs by AUTOCOD with those by human coders at the Directorate-General of Health (gold standard). Subsequently, we compared periods without excess mortality with periods of excess, severe, and extreme excess mortality. We defined excess mortality as 2 consecutive days with a Z score above the 95% baseline limit, severe excess mortality as 2 consecutive days with a Z score >4 SDs, and extreme excess mortality as 2 consecutive days with a Z score >6 SDs. Finally, we repeated the analyses for the 3 most common ICD-10 chapters focusing on block-level classification.
Results: We analyzed a large data set comprising 330,098 DCs classified by both human coders and AUTOCOD. AUTOCOD demonstrated high sensitivity (≥0.75) for 10 ICD-10 chapters examined, with values surpassing 0.90 for the more prevalent chapters (chapter II-"Neoplasms," chapter IX-"Diseases of the circulatory system," and chapter X-"Diseases of the respiratory system"), accounting for 67.69% (223,459/330,098) of all human-coded causes of death. No substantial differences were observed in these high-sensitivity values when comparing periods without excess mortality with periods of excess, severe, and extreme excess mortality. The same holds for specificity, which exceeded 0.96 for all chapters examined, and for PPV, which surpassed 0.75 in 9 chapters, including the more prevalent ones. When considering block classification within the 3 most common ICD-10 chapters, AUTOCOD maintained a high performance, demonstrating high sensitivity (≥0.75) for 13 ICD-10 blocks, high PPV for 9 blocks, and specificity of >0.98 in all blocks, with no significant differences between periods without excess mortality and those with excess mortality.
Conclusions: Our findings indicate that, during periods of excess and extreme excess mortality, AUTOCOD's performance remains unaffected by potential text quality degradation because of pressure on health services. Consequently, AUTOCOD can be dependably used for real-time cause-specific mortality surveillance even in extreme excess mortality situations.
Keywords: AI; artificial intelligence; deep learning; deep neural networks; evaluation; machine learning; mortality; mortality statistics; underlying cause of death.
©Patrícia Pita Ferreira, Diogo Godinho Simões, Constança Pinto de Carvalho, Francisco Duarte, Eugénia Fernandes, Pedro Casaca Carvalho, José Francisco Loff, Ana Paula Soares, Maria João Albuquerque, Pedro Pinto-Leite, André Peralta-Santos. Originally published in JMIR AI (https://ai.jmir.org), 22.11.2023.
Conflict of interest statement
Conflicts of Interest: None declared.
Figures



Similar articles
-
[SENTIERI - Epidemiological Study of Residents in National Priority Contaminated Sites. Sixth Report].Epidemiol Prev. 2023 Jan-Apr;47(1-2 Suppl 1):1-286. doi: 10.19191/EP23.1-2-S1.003. Epidemiol Prev. 2023. PMID: 36825373 Italian.
-
[State of health of populations residing in geothermal areas of Tuscany].Epidemiol Prev. 2012 Sep-Oct;36(5 Suppl 1):1-104. Epidemiol Prev. 2012. PMID: 23139155 Italian.
-
Combining deep neural networks, a rule-based expert system and targeted manual coding for ICD-10 coding causes of death of French death certificates from 2018 to 2019.Int J Med Inform. 2024 Aug;188:105462. doi: 10.1016/j.ijmedinf.2024.105462. Epub 2024 Apr 26. Int J Med Inform. 2024. PMID: 38733641
-
[Causes of death in children and adolescents aged 1-19 in poland in the light of international statistics since 2000].Dev Period Med. 2017;21(2):111-123. doi: 10.34763/devperiodmed.20172102.111123. Dev Period Med. 2017. PMID: 28796982 Free PMC article.
-
Artificial intelligence in clinical care amidst COVID-19 pandemic: A systematic review.Comput Struct Biotechnol J. 2021;19:2833-2850. doi: 10.1016/j.csbj.2021.05.010. Epub 2021 May 7. Comput Struct Biotechnol J. 2021. PMID: 34025952 Free PMC article. Review.
References
-
- Graphs and maps. EuroMOMO. [2023-04-13]. https://www.euromomo.eu/graphs-and-maps/
-
- Vestergaard LS, Nielsen J, Richter L, Schmid D, Bustos N, Braeye T, Denissov G, Veideman T, Luomala O, Möttönen T, Fouillet A, Caserio-Schönemann C, An der Heiden M, Uphoff H, Lytras T, Gkolfinopoulou K, Paldy A, Domegan L, O'Donnell J, De' Donato F, Noccioli F, Hoffmann P, Velez T, England K, van Asten L, White RA, Tønnessen R, da Silva SP, Rodrigues AP, Larrauri A, Delgado-Sanz C, Farah A, Galanis I, Junker C, Perisa D, Sinnathamby M, Andrews N, O'Doherty M, Marquess DF, Kennedy S, Olsen SJ, Pebody R, Krause TG, Mølbak K. Excess all-cause mortality during the COVID-19 pandemic in Europe - preliminary pooled estimates from the EuroMOMO network, March to April 2020. Euro Surveill. 2020 Jul;25(26):2001214. doi: 10.2807/1560-7917.ES.2020.25.26.2001214. http://www.eurosurveillance.org/content/10.2807/1560-7917.ES.2020.25.26.... - DOI - DOI - PMC - PubMed
-
- Pinto CS, Anderson RN, Martins H, Marques C, Maia C, Borralho MC. Mortality Information System in Portugal: transition to e-death certification. Eurohealth (Lond) 2016;22(2):1–53. https://europepmc.org/abstract/MED/32336930 - PMC - PubMed
-
- SICO - eVM. Vigilância de Mortalidade. [2021-11-08]. https://evm.min-saude.pt/#shiny-tab-info_eVM .
-
- International Statistical Classification of Diseases and Health-Related Problems, 10th Revision, 5th Edition, 2016. World Health Organization. 2015. [2021-11-09]. https://apps.who.int/iris/handle/10665/246208 .
LinkOut - more resources
Full Text Sources