Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Nov 14;14(1):27968.
doi: 10.1038/s41598-024-76847-3.

Prediction of acute respiratory infections using machine learning techniques in Amhara Region, Ethiopia

Affiliations

Prediction of acute respiratory infections using machine learning techniques in Amhara Region, Ethiopia

Abdulaziz Kebede Kassaw et al. Sci Rep. .

Abstract

Many studies have shown that infectious diseases are responsible for the majority of deaths in children under five. Among these children, Acute Respiratory Infections is the most prevalent illness and cause of death worldwide. Acute respiratory infections continue to be the leading cause of death in developing countries, including Ethiopia. In order to predict the main factors contributing to acute respiratory infections in the Amhara regional state of Ethiopia, a machine learning technique was employed. This study utilized data from the 2016 Ethiopian Demographic and Health Survey. Seven machine learning models, including logistic regression, random forests, decision trees, Gradient Boosting, support vector machines, Naïve Bayes, and K-nearest neighbors, were employed to forecast the factors influencing acute respiratory infections. The accuracy of each model was assessed using receiver operating characteristic curves and various metrics. Among the seven models used, the Random Forest algorithm demonstrated the highest accuracy in predicting acute respiratory infections, with an accuracy rate of 90.35% and Area under the Curve of 94.80%. This was followed by the Decision Tree model with an accuracy rate of 88.69%, K-nearest neighbors with 86.35%, and Gradient Boosting with 82.69%. The Random Forest algorithm also exhibited positive and negative predictive values of 92.22% and 88.83%, respectively. Several factors were identified as significantly associated with ARI among children under five in the Amhara regional state, Ethiopia. These factors, included families with a poorer wealth status (log odds of 0.18) compared to their counterparts, families with four to six children (log odds of 0.1) compared to families with fewer than three living children, children without a history of diarrhea (log odds of -0.08), mothers who had occupation(log odds of 0.06) compared mothers who didn't have occupation, children under six months of age (log odds of -0.05) compared to children older than six months, mothers with no education (log odds of 0.04) compared to mothers with primary education or higher, rural residents (log odds of 0.03) compared to non-rural residents, families using wood as a cooking material (log odds of 0.03) compared to those using electricity. Through Shapley Additive exPlanations value analysis on the Random Forest algorithm, we have identified significant risk factors for acute respiratory infections among children in the Amhara regional state of Ethiopia. The study found that the family's wealth index, the number of children in the household, the mother's occupation, the mother's educational level, the type of residence, and the fuel type used for cooking were all associated with acute respiratory infections. Additionally, the research emphasized the importance of children being free from diarrhea and living in households with fewer children as essential factors for improving children's health outcomes in the Amhara regional state, Ethiopia.

PubMed Disclaimer

Conflict of interest statement

Declarations Consent for publication Not applicable. Competing interests The authors declare no competing interests. Ethical approval and consent to participate The researchers received the survey data approval letter from the USAID DHS program after registering with the link https://www.dhsprogram.com/data/dataset_admin/login_main.cfm and then the researchers of this study maintained the confidentiality and privacy of the data. We have obtained authorization letter from ICF to use this data and we attached the letter as an annex. The study does not require ethical approval because it was a secondary data analysis using the 2016, EDHS dataset. After receiving the data from the USAID–DHS program, the researchers in this study maintained the data’s anonymity. During the survey, informed consent was received from the study participants prior to the start of study. All methods were carried out in accordance with relevant guidelines and regulations.

Figures

Fig. 1
Fig. 1
Feature selection using Boruta algorithm.
Fig. 2
Fig. 2
ROC curve for the seven models.
Fig. 3
Fig. 3
SHAP global importance plot of optimized Random Forest model Class 0 = no ARI; Class 1 = had ARI.
Fig. 4
Fig. 4
Beeswarm plot, ranked by mean absolute SHAP value generated by optimized Random Forest.
Fig. 5
Fig. 5
Bar plot result of global shaply value outcome generated by optimized random forest model.
Fig. 6
Fig. 6
Waterfall plot displaying prediction of the ARI positive observation.

Similar articles

Cited by

References

    1. Black, R. E. et al. Global, regional, and national causes of child mortality in 2008: a systematic analysis. Lancet375(9730), 1969–1987 (2010). - PubMed
    1. Organization, W.H., World health statistics 2015. 2015: World Health Organization.
    1. Broor, S. et al. A prospective three-year cohort study of the epidemiology and virology of acute respiratory infections of children in rural India. PLoS ONE2(6), e491 (2007). - PMC - PubMed
    1. Gupta, G. R. Tackling pneumonia and diarrhoea: the deadliest diseases for the world’s poorest children. Lancet379(9832), 2123–2124 (2012). - PubMed
    1. Young, M. et al. World Health Organization/United Nations Children’s Fund joint statement on integrated community case management: an equity-focused strategy to improve access to essential treatment services for children. Am J Trop Med Hyg87(5 Suppl), 6 (2012). - PMC - PubMed

LinkOut - more resources