Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Apr 11;19(4):e0300172.
doi: 10.1371/journal.pone.0300172. eCollection 2024.

Application of machine learning methods for predicting childhood anaemia: Analysis of Ethiopian Demographic Health Survey of 2016

Affiliations

Application of machine learning methods for predicting childhood anaemia: Analysis of Ethiopian Demographic Health Survey of 2016

Solomon Hailemariam Tesfaye et al. PLoS One. .

Abstract

Childhood anaemia is a public health problem in Ethiopia. Machine learning (ML) is a growing in medicine field to predict diseases. Diagnosis of childhood anaemia is resource intensive. The aim of this study is to apply machine learning (ML) algorithm to predict childhood anaemia using socio-demographic, economic, and maternal and child related variables. The study used data from 2016 Ethiopian demographic health survey (EDHS). We used Python software version 3.11 to apply and test ML algorithms through logistic regression, Random Forest (RF), Decision Tree, and K-Nearest Neighbours (KNN). We evaluated the performance of each of the ML algorithms using discrimination and calibration parameters. The predictive performance of the algorithms was between 60% and 66%. The logistic regression model was the best predictive model of ML with accuracy (66%), sensitivity (82%), specificity (42%), and AUC (69%), followed by RF with accuracy (64%), sensitivity (79%), specificity (42%), and AUC (63%). The logistic regression and the RF models of ML showed poorest family, child age category between 6 and 23 months, uneducated mother, unemployed mother, and stunting as high importance predictors of childhood anaemia. Applying logistic regression and RF models of ML can detect combinations of predictors of childhood anaemia that can be used in primary health care professionals.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. ROC Curves for the four models.
Fig 2
Fig 2. Calibration plots for the four models.
Fig 3
Fig 3. Variable importance measures for the logistic regression algorithm.
Fig 4
Fig 4. Variable importance for the random forest algorithm.

Similar articles

Cited by

References

    1. Guyton AC, Hall JE. Text Book of Medical Physiology, 14th edition. Jackson, Mississippi, USA: Elsevier Inc.; 2006; 11.
    1. Nutritional anaemias: tools for effective prevention and control. Geneva: World Health Organization; 2017. Licence: CC BY-NC-SA 3.0 IGO. Available from http://apps.who.int/iris/bitstream/handle/10665/259425/9789241513067-eng.... Acessed [02 March, 2023].
    1. Kassebaum NJ. The Global Burden of Anemia. Hematology/oncology clinics of North America 2016;30(2):247–308. doi: 10.1016/j.hoc.2015.11.002 - DOI - PubMed
    1. Roberts DJ, Matthews G, Snow RW, Zewotir T, Sartorius B. Investigating the spatial variation and risk factors of childhood anaemia in four sub-Saharan African countries. BMC public health 2020;20(1):126. doi: 10.1186/s12889-020-8189-8 - DOI - PMC - PubMed
    1. Petry N, Olofin I, Hurrell RF, Boy E, Wirth JP, Moursi M, et al.. The Proportion of Anemia Associated with Iron Deficiency in Low, Medium, and High Human Development Index Countries: A Systematic Analysis of National Surveys. Nutrients 2016;8(11). doi: 10.3390/nu8110693 - DOI - PMC - PubMed