Spatiotemporal models of dengue epidemiology in the Philippines: Integrating remote sensing and interpretable machine learning
- PMID: 38701871
- DOI: 10.1016/j.actatropica.2024.107225
Spatiotemporal models of dengue epidemiology in the Philippines: Integrating remote sensing and interpretable machine learning
Abstract
Previous dengue epidemiological analyses have been limited in spatiotemporal extent or covariate dimensions, the latter neglecting the multifactorial nature of dengue. These constraints, caused by rigid and traditional statistical tools which collapse amidst 'Big Data', prompt interpretable machine-learning (iML) approaches. Predicting dengue incidence and mortality in the Philippines, a data-limited yet high-burden country, the mlr3 universe of R packages was used to build and optimize ML models based on remotely sensed provincial and dekadal 3 NDVI and 9 rainfall features from 2016 to 2020. Between two tasks, models differ across four random forest-based learners and two clustering strategies. Among 16 candidates, rfsrc-year-case and ranger-year-death significantly perform best for predicting dengue incidence and mortality, respectively. Therefore, temporal clustering yields the best models, reflective of dengue seasonality. The two best models were subjected to tripartite global exploratory model analyses, which encompass model-agnostic post-hoc methods such as Permutation Feature Importance (PFI) and Accumulated Local Effects (ALE). PFI reveals that the models differ in their important explanatory aspect, rainfall for rfsrc-year-case and NDVI for ranger-year-death, among which long-term average (lta) features are most relevant. Trend-wise, ALE reveals that average incidence predictions are positively associated with 'Rain.lta', reflective of dengue cases peaking during the wet season. In contrast, those for mortality are negatively associated with 'NDVI.lta', reflective of urban spaces driving dengue-related deaths. By technologically addressing the challenges of the human-animal-ecosystem interface, this study adheres to the One Digital Health paradigm operationalized under Sustainable Development Goals (SDGs). Leveraging data digitization and predictive modeling for epidemiological research paves SDG 3, which prioritizes holistic health and well-being.
Keywords: Dengue virus (DENV); Explainable artificial intelligence; One health; Spatiotemporal epidemiology; Vector-borne disease.
Copyright © 2024 Elsevier B.V. All rights reserved.
Conflict of interest statement
Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
