Predicting the incidence of infectious diarrhea with symptom surveillance data using a stacking-based ensembled model
- PMID: 38408967
- PMCID: PMC10898154
- DOI: 10.1186/s12879-024-09138-x
Predicting the incidence of infectious diarrhea with symptom surveillance data using a stacking-based ensembled model
Abstract
Background: Infectious diarrhea remains a major public health problem worldwide. This study used stacking ensemble to developed a predictive model for the incidence of infectious diarrhea, aiming to achieve better prediction performance.
Methods: Based on the surveillance data of infectious diarrhea cases, relevant symptoms and meteorological factors of Guangzhou from 2016 to 2021, we developed four base prediction models using artificial neural networks (ANN), Long Short-Term Memory networks (LSTM), support vector regression (SVR) and extreme gradient boosting regression trees (XGBoost), which were then ensembled using stacking to obtain the final prediction model. All the models were evaluated with three metrics: mean absolute percentage error (MAPE), root mean square error (RMSE), and mean absolute error (MAE).
Results: Base models that incorporated symptom surveillance data and weekly number of infectious diarrhea cases were able to achieve lower RMSEs, MAEs, and MAPEs than models that added meteorological data and weekly number of infectious diarrhea cases. The LSTM had the best prediction performance among the four base models, and its RMSE, MAE, and MAPE were: 84.85, 57.50 and 15.92%, respectively. The stacking ensembled model outperformed the four base models, whose RMSE, MAE, and MAPE were 75.82, 55.93, and 15.70%, respectively.
Conclusions: The incorporation of symptom surveillance data could improve the predictive accuracy of infectious diarrhea prediction models, and symptom surveillance data was more effective than meteorological data in enhancing model performance. Using stacking to combine multiple prediction models were able to alleviate the difficulty in selecting the optimal model, and could obtain a model with better performance than base models.
Keywords: Ensemble learning; Infectious diarrhea; Prediction model; Stacking; Symptom surveillance.
© 2024. The Author(s).
Conflict of interest statement
The authors have no relevant financial or non-financial interests to disclose.
Figures





Similar articles
-
Deep learning models for hepatitis E incidence prediction leveraging meteorological factors.PLoS One. 2023 Mar 13;18(3):e0282928. doi: 10.1371/journal.pone.0282928. eCollection 2023. PLoS One. 2023. PMID: 36913401 Free PMC article.
-
Explainable prediction of daily hospitalizations for cerebrovascular disease using stacked ensemble learning.BMC Med Inform Decis Mak. 2023 Apr 6;23(1):59. doi: 10.1186/s12911-023-02159-7. BMC Med Inform Decis Mak. 2023. PMID: 37024922 Free PMC article.
-
Prediction of hepatitis E using machine learning models.PLoS One. 2020 Sep 17;15(9):e0237750. doi: 10.1371/journal.pone.0237750. eCollection 2020. PLoS One. 2020. PMID: 32941452 Free PMC article.
-
Improving the precision of modeling the incidence of hemorrhagic fever with renal syndrome in mainland China with an ensemble machine learning approach.PLoS One. 2021 Mar 16;16(3):e0248597. doi: 10.1371/journal.pone.0248597. eCollection 2021. PLoS One. 2021. PMID: 33725011 Free PMC article.
-
Study of the influence of meteorological factors on HFMD and prediction based on the LSTM algorithm in Fuzhou, China.BMC Infect Dis. 2023 May 5;23(1):299. doi: 10.1186/s12879-023-08184-1. BMC Infect Dis. 2023. PMID: 37147566 Free PMC article.
Cited by
-
From COVID-19 to monkeypox: a novel predictive model for emerging infectious diseases.BioData Min. 2024 Oct 22;17(1):42. doi: 10.1186/s13040-024-00396-8. BioData Min. 2024. PMID: 39438943 Free PMC article.
References
-
- Abbafati C, Abbas KM, Abbasi M, Abbasifard M, Abbasi-Kangevari M, et al. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: a systematic analysis for the global burden of Disease Study 2019. Lancet. 2020;396(10258):1204–22. doi: 10.1016/S0140-6736(20)30925-9. - DOI - PMC - PubMed
-
- Wang Yongming J, Li J, Gu Z, Zhou, Wang Z. Artificial neural networks for infectious diarrhea prediction using meteorological factors in Shanghai (China) Appl Soft Comput. 2015;35:280–90. doi: 10.1016/j.asoc.2015.05.047. - DOI
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources