Integrating machine learning and spatial clustering for malaria case prediction in Brazil's Legal Amazon
- PMID: 40484933
- PMCID: PMC12147289
- DOI: 10.1186/s12879-025-11193-x
Integrating machine learning and spatial clustering for malaria case prediction in Brazil's Legal Amazon
Abstract
Malaria remains a major global health challenge, particularly in Brazil's Legal Amazon region, where environmental and socioeconomic conditions foster favorable conditions for disease transmission. Traditional control measures have shown limited effectiveness, emphasizing the need for better predictive approaches to support timely and targeted public health interventions. This study evaluates the performance of six computational models-Long Short-Term Memory (LSTM), Gated Recurrent Units (GRU), Support Vector Regression (SVR), Random Forest (RF), eXtreme Gradient Boosting (XGBoost), and Autoregressive Integrated Moving Average (ARIMA)-for forecasting weekly malaria cases across multiple states in the Legal Amazon. The results demonstrate that the RF model consistently outperformed the other models, achieving the lowest Root Mean Squared Error (RMSE) and Mean Absolute Error (MAE) values in most cases, such as in cluster 02 of the state of Acre, with RMSE of 0.00203 and MAE of 0.00133. The integration of K-means clustering further improved the model predictive accuracy by accounting for spatial heterogeneity and capturing localized transmission dynamics. This hybrid modeling approach, combining machine learning models with spatial clustering, offers a promising tool for enhancing malaria surveillance and guiding more effective public health strategies, especially for malaria control efforts in high-risk regions.
Keywords: Machine Learning; Malaria Prediction; Public Health Surveillance; Spatial Clustering; Time Series Forecasting.
© 2025. The Author(s).
Conflict of interest statement
Declarations. Ethics approval and consent to participate: The malaria analysis data were authorized by the FMT-HVD Comitê de Ética em Pesquisa (CEP) under CAAE number 92012818.1.0000.0005. The data used in the manuscript is anonymized and publicly available by the Brazilian Ministry of Health; therefore, according to Brazilian regulations, there is no need for an informed consent process or even previous authorization for use. Consent for publication: Not applicable. Competing interests: The authors declare no competing interests.
Figures







References
-
- Ministério da Saúde SDVES. Panorama epidemiológico da malária em 2021: buscando o caminho para a eliminação da malária no Brasil (Ministério da Saúde, Brazil, 2022). URL https://www.gov.br/saude/pt-br/centrais-de-conteudo/publicacoes/boletins....
-
- Organization WH. World malaria report 2022. 2022. URL https://www.who.int/publications/i/item/9789240064898. Accessed: 2024–02–04.
-
- World Health Organization. World malaria report 2024. 2024. https://www.who.int/publications/malaria-report-2024. Accessed May 2025.
-
- Pattanayak SK, Pakhtigian EL, Litzow EL. Through the looking glass: environmental health economics in low and middle income countries. 2018.
-
- Ashley EA, Phyo AP, Woodrow CJ. Malaria. Lancet. 2018;391:1608–21. - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical