Machine Learning Algorithms for understanding the determinants of under-five Mortality

doi:10.1186/s13040-022-00308-8

. 2022 Sep 24;15(1):20.

doi: 10.1186/s13040-022-00308-8.

Machine Learning Algorithms for understanding the determinants of under-five Mortality

Rakesh Kumar Saroj¹, Pawan Kumar Yadav², Rajneesh Singh³, Obvious N Chilyabanyama^{4

5}

Affiliations

¹ Department of Community Medicine, Sikkim Manipal Institute of Medical Sciences-Sikkim Manipal University, Gangtok, Sikkim, 737102, India. rakesh.saroj@bhu.ac.in.
² Department of Biostatistics and Epidemiology, International Institute for Population Sciences, Mumbai, 400088, India.
³ Department of Mathematics and Statistics, Banasthali Vidyapith, Vanasthali Rd, Aliyabad, Tonk, Rajasthan, 304022, India.
⁴ Centre for Infectious Disease Research in Zambia, Lusaka, Zambia.
⁵ African Centre of Excellency in Data Science (ACEDS), University of Rwanda, KK 737 Street, Gikondo, Kigali, Rwanda.

PMID: 36153553
PMCID: PMC9509654
DOI: 10.1186/s13040-022-00308-8

Machine Learning Algorithms for understanding the determinants of under-five Mortality

Rakesh Kumar Saroj et al. BioData Min. 2022.

. 2022 Sep 24;15(1):20.

doi: 10.1186/s13040-022-00308-8.

Authors

Rakesh Kumar Saroj¹, Pawan Kumar Yadav², Rajneesh Singh³, Obvious N Chilyabanyama^{4

5}

Affiliations

¹ Department of Community Medicine, Sikkim Manipal Institute of Medical Sciences-Sikkim Manipal University, Gangtok, Sikkim, 737102, India. rakesh.saroj@bhu.ac.in.
² Department of Biostatistics and Epidemiology, International Institute for Population Sciences, Mumbai, 400088, India.
³ Department of Mathematics and Statistics, Banasthali Vidyapith, Vanasthali Rd, Aliyabad, Tonk, Rajasthan, 304022, India.
⁴ Centre for Infectious Disease Research in Zambia, Lusaka, Zambia.
⁵ African Centre of Excellency in Data Science (ACEDS), University of Rwanda, KK 737 Street, Gikondo, Kigali, Rwanda.

PMID: 36153553
PMCID: PMC9509654
DOI: 10.1186/s13040-022-00308-8

Abstract

Background: Under-five mortality is a matter of serious concern for child health as well as the social development of any country. The paper aimed to find the accuracy of machine learning models in predicting under-five mortality and identify the most significant factors associated with under-five mortality.

Method: The data was taken from the National Family Health Survey (NFHS-IV) of Uttar Pradesh. First, we used multivariate logistic regression due to its capability for predicting the important factors, then we used machine learning techniques such as decision tree, random forest, Naïve Bayes, K- nearest neighbor (KNN), logistic regression, support vector machine (SVM), neural network, and ridge classifier. Each model's accuracy was checked by a confusion matrix, accuracy, precision, recall, F1 score, Cohen's Kappa, and area under the receiver operating characteristics curve (AUROC). Information gain rank was used to find the important factors for under-five mortality. Data analysis was performed using, STATA-16.0, Python 3.3, and IBM SPSS Statistics for Windows, Version 27.0 software.

Result: By applying the machine learning models, results showed that the neural network model was the best predictive model for under-five mortality when compared with other predictive models, with model accuracy of (95.29% to 95.96%), recall (71.51% to 81.03%), precision (36.64% to 51.83%), F1 score (50.46% to 62.68%), Cohen's Kappa value (0.48 to 0.60), AUROC range (93.51% to 96.22%) and precision-recall curve range (99.52% to 99.73%). The neural network was the most efficient model, but logistic regression also shows well for predicting under-five mortality with accuracy (94% to 95%)., AUROC range (93.4% to 94.8%), and precision-recall curve (99.5% to 99.6%). The number of living children, survival time, wealth index, child size at birth, birth in the last five years, the total number of children ever born, mother's education level, and birth order were identified as important factors influencing under-five mortality.

Conclusion: The neural network model was a better predictive model compared to other machine learning models in predicting under-five mortality, but logistic regression analysis also shows good results. These models may be helpful for the analysis of high-dimensional data for health research.

Keywords: Accuracy; Machine learning; Neural Network; Random Forest; Under-five mortality.

PubMed Disclaimer

Conflict of interest statement

The authors declared that they have no competing interests.

Figures

**Fig. 1**
Under-five mortality of Uttar Pradesh comparison graph of state-wise from (NFHS-4)

**Fig. 2**
Overview of the proposed framework of machine learning for under-five child mortality data

**Fig. 3**
ROC curve for machine learning models in predicting under-five mortality with all factors (70/30 Ratio)

**Fig. 4**
Precision-Recall curves for machine learning models in predicting under-five mortality with all factors (70/30 Ratio)\

**Fig. 5**
ROC curve for machine learning models in predicting under-five mortality with all factors (80/20 Ratio)

**Fig. 6**
Precision-Recall curve for machine learning models in predicting under-five mortality with all factors (80/20 Ratio)

**Fig. 7**
Information gain rank values of the variables under study

**Fig. 8**
ROC curve for machine learning models in predicting under-five mortality with important factors (70/30) Ratio

**Fig. 9**
Precision-Recall curve for machine learning models in predicting under-five mortality with important factors (70/30 Ratio)

**Fig. 10**
ROC curve for machine learning models in predicting under-five mortality with important factors (80/20 Ratio)

**Fig. 11**
Precision-Recall curve for machine learning models in predicting under-five mortality with important factors (80/20 Ratio)

See this image and copyright information in PMC

Cited by

Multi-parametric MRI-based machine learning model for prediction of pathological grade of renal injury in a rat kidney cold ischemia-reperfusion injury model.
Chen L, Ren Y, Yuan Y, Xu J, Wen B, Xie S, Zhu J, Li W, Gong X, Shen W. Chen L, et al. BMC Med Imaging. 2024 Jul 26;24(1):188. doi: 10.1186/s12880-024-01320-6. BMC Med Imaging. 2024. PMID: 39060984 Free PMC article.
Subnational estimates of life expectancy at birth in India: evidence from NFHS and SRS data.
Yadav PK, Yadav S. Yadav PK, et al. BMC Public Health. 2024 Apr 16;24(1):1058. doi: 10.1186/s12889-024-18278-3. BMC Public Health. 2024. PMID: 38627658 Free PMC article.
Machine learning-based models for prediction of the risk of stroke in coronary artery disease patients receiving coronary revascularization.
Lin L, Ding L, Fu Z, Zhang L. Lin L, et al. PLoS One. 2024 Feb 8;19(2):e0296402. doi: 10.1371/journal.pone.0296402. eCollection 2024. PLoS One. 2024. PMID: 38330052 Free PMC article.
Predictors of micronutrient deficiency among children aged 6-23 months in Ethiopia: a machine learning approach.
Gebeye LG, Dessie EY, Yimam JA. Gebeye LG, et al. Front Nutr. 2024 Jan 5;10:1277048. doi: 10.3389/fnut.2023.1277048. eCollection 2023. Front Nutr. 2024. PMID: 38249594 Free PMC article.
Prediction of incomplete immunization among under-five children in East Africa from recent demographic and health surveys: a machine learning approach.
Tadese ZB, Nigatu AM, Yehuala TZ, Sebastian Y. Tadese ZB, et al. Sci Rep. 2024 May 21;14(1):11529. doi: 10.1038/s41598-024-62641-8. Sci Rep. 2024. PMID: 38773175 Free PMC article.

See all "Cited by" articles

References

1. IIPS, ICF. National Family Health Survey (NFHS-4), 2015–16: India. Mumbai: International Institute for Population Sciences 2017.
1. http://rchiips.org/nfhs/NFHS-4Reports/India.pdf (access on 23/07/2021 at 2.50 PM (IST)).
1. Patel CJ. Analytic complexity and challenges in identifying mixtures of exposures associated with phenotypes in the exposome era. Current epidemiology reports. 2017;4(1):22–30. doi: 10.1007/s40471-017-0100-5. - DOI - PMC - PubMed
1. Tesfaye B, Atique S, Elias N, Dibaba L, Shabbir SA, Kebede M. Determinants and development of a web-based child mortality prediction model in resource-limited settings: a data mining approach. Comput Methods Programs Biomed. 2017;140:45–51. doi: 10.1016/j.cmpb.2016.11.013. - DOI - PubMed
1. Fenta HM, Zewotir T, Muluneh EK. A machine learning classifier approach for identifying the determinants of under-five child undernutrition in Ethiopian administrative zones. BMC Med Inform Decis Mak. 2021;21:291. doi: 10.1186/s12911-021-01652-1. - DOI - PMC - PubMed

LinkOut - more resources

Full Text Sources

[1] IIPS, ICF. National Family Health Survey (NFHS-4), 2015–16: India. Mumbai: International Institute for Population Sciences 2017.

[2] IIPS, ICF. National Family Health Survey (NFHS-4), 2015–16: India. Mumbai: International Institute for Population Sciences 2017.

[3] http://rchiips.org/nfhs/NFHS-4Reports/India.pdf (access on 23/07/2021 at 2.50 PM (IST)).

[4] http://rchiips.org/nfhs/NFHS-4Reports/India.pdf (access on 23/07/2021 at 2.50 PM (IST)).

[5] Patel CJ. Analytic complexity and challenges in identifying mixtures of exposures associated with phenotypes in the exposome era. Current epidemiology reports. 2017;4(1):22–30. doi: 10.1007/s40471-017-0100-5. - DOI - PMC - PubMed

[6] Patel CJ. Analytic complexity and challenges in identifying mixtures of exposures associated with phenotypes in the exposome era. Current epidemiology reports. 2017;4(1):22–30. doi: 10.1007/s40471-017-0100-5. - DOI - PMC - PubMed

[7] Tesfaye B, Atique S, Elias N, Dibaba L, Shabbir SA, Kebede M. Determinants and development of a web-based child mortality prediction model in resource-limited settings: a data mining approach. Comput Methods Programs Biomed. 2017;140:45–51. doi: 10.1016/j.cmpb.2016.11.013. - DOI - PubMed

[8] Tesfaye B, Atique S, Elias N, Dibaba L, Shabbir SA, Kebede M. Determinants and development of a web-based child mortality prediction model in resource-limited settings: a data mining approach. Comput Methods Programs Biomed. 2017;140:45–51. doi: 10.1016/j.cmpb.2016.11.013. - DOI - PubMed

[9] Fenta HM, Zewotir T, Muluneh EK. A machine learning classifier approach for identifying the determinants of under-five child undernutrition in Ethiopian administrative zones. BMC Med Inform Decis Mak. 2021;21:291. doi: 10.1186/s12911-021-01652-1. - DOI - PMC - PubMed

[10] Fenta HM, Zewotir T, Muluneh EK. A machine learning classifier approach for identifying the determinants of under-five child undernutrition in Ethiopian administrative zones. BMC Med Inform Decis Mak. 2021;21:291. doi: 10.1186/s12911-021-01652-1. - DOI - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Machine Learning Algorithms for understanding the determinants of under-five Mortality

Affiliations

Machine Learning Algorithms for understanding the determinants of under-five Mortality

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Related information

LinkOut - more resources

Full Text Sources