Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 May 16;19(5):e0299386.
doi: 10.1371/journal.pone.0299386. eCollection 2024.

Predicting malaria outbreak in The Gambia using machine learning techniques

Affiliations

Predicting malaria outbreak in The Gambia using machine learning techniques

Ousman Khan et al. PLoS One. .

Abstract

Malaria is the most common cause of death among the parasitic diseases. Malaria continues to pose a growing threat to the public health and economic growth of nations in the tropical and subtropical parts of the world. This study aims to address this challenge by developing a predictive model for malaria outbreaks in each district of The Gambia, leveraging historical meteorological data. To achieve this objective, we employ and compare the performance of eight machine learning algorithms, including C5.0 decision trees, artificial neural networks, k-nearest neighbors, support vector machines with linear and radial kernels, logistic regression, extreme gradient boosting, and random forests. The models are evaluated using 10-fold cross-validation during the training phase, repeated five times to ensure robust validation. Our findings reveal that extreme gradient boosting and decision trees exhibit the highest prediction accuracy on the testing set, achieving 93.3% accuracy, followed closely by random forests with 91.5% accuracy. In contrast, the support vector machine with a linear kernel performs less favorably, showing a prediction accuracy of 84.8% and underperforming in specificity analysis. Notably, the integration of both climatic and non-climatic features proves to be a crucial factor in accurately predicting malaria outbreaks in The Gambia.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. The map of the Gambia showing its five main meteorological stations across different regions.
Fig 2
Fig 2. Distribution of climatic variables, malaria cases and deaths: Malaria cases, deaths and rainfall are highly skewed to the right.
None of the features appeared to be distributed normally.
Fig 3
Fig 3. Proportion of outbreak in both the training and testing data.
Fig 4
Fig 4. The Pearson’s correlation among all the climatic variables used.
Fig 5
Fig 5. Variation of the climatic variables, total malaria cases and deaths in each month over the years.
In the upper panels of the first row, we can observe the number of malaria cases and deaths in thousands, although not used as predictors. Each line represents a different year, indicated by the various colors. The bottom four panels show the climate variables and their variability. The boxplots depict each month of the year, while the colored dots represent the data points in the corresponding years. The minimum and maximum temperatures were measured in degrees Celsius (°C), the rainfall was measured in millimeters (mm), and the relative humidity was measured in percentages (%).
Fig 6
Fig 6. This plot shows the area under the Receiver Operating Characteristic (ROC), (AUC) of all the learning algorithms.
The green and the blue curve shows the ROC curve using the training and testing data respectively. XGBoost has the highest AUC value of 0.97. (a) C5.0 DT. (b) KNN. (c) Logistic Regression. (d) ANN. (e) Random Forest. (f) XGBoost. (g) svmLinear. (h) svmRadial.

Similar articles

Cited by

References

    1. Nyasa RB, Fotabe EL, Ndip RN. Trends in malaria prevalence and risk factors associated with the disease in Nkongho-mbeng; a typical rural setting in the equatorial rainforest of the South West Region of Cameroon. Plos one. 2021;16(5):e0251380. doi: 10.1371/journal.pone.0251380 - DOI - PMC - PubMed
    1. Monroe A, Williams NA, Ogoma S, Karema C, Okumu F. Reflections on the 2021 World Malaria Report and the future of malaria control; 2022. - PMC - PubMed
    1. Moyes CL, Athinya DK, Seethaler T, Battle KE, Sinka M, Hadi MP, et al.. Evaluating insecticide resistance across African districts to aid malaria control decisions. Proceedings of the National Academy of Sciences. 2020;117(36):22042–22050. doi: 10.1073/pnas.2006781117 - DOI - PMC - PubMed
    1. Liu Q, Jing W, Kang L, Liu J, Liu M. Trends of the global, regional and national incidence of malaria in 204 countries from 1990 to 2019 and implications for malaria prevention. Journal of Travel Medicine. 2021;28(5):taab046. doi: 10.1093/jtm/taab046 - DOI - PMC - PubMed
    1. Mohammadkhani M, Khanjani N, Bakhtiari B, Tabatabai SM, Sheikhzadeh K. The relation between climatic factors and malaria incidence in Sistan and Baluchestan, Iran. Sage Open. 2019;9(3):2158244019864205. doi: 10.1177/2158244019864205 - DOI