Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Aug 12;6(3):506-20.
doi: 10.4338/ACI-2015-03-RA-0036. eCollection 2015.

Machine Learning Techniques for Prediction of Early Childhood Obesity

Affiliations

Machine Learning Techniques for Prediction of Early Childhood Obesity

T M Dugan et al. Appl Clin Inform. .

Abstract

Objectives: This paper aims to predict childhood obesity after age two, using only data collected prior to the second birthday by a clinical decision support system called CHICA.

Methods: Analyses of six different machine learning methods: RandomTree, RandomForest, J48, ID3, Naïve Bayes, and Bayes trained on CHICA data show that an accurate, sensitive model can be created.

Results: Of the methods analyzed, the ID3 model trained on the CHICA dataset proved the best overall performance with accuracy of 85% and sensitivity of 89%. Additionally, the ID3 model had a positive predictive value of 84% and a negative predictive value of 88%. The structure of the tree also gives insight into the strongest predictors of future obesity in children. Many of the strongest predictors seen in the ID3 modeling of the CHICA dataset have been independently validated in the literature as correlated with obesity, thereby supporting the validity of the model.

Conclusions: This study demonstrated that data from a production clinical decision support system can be used to build an accurate machine learning model to predict obesity in children after age two.

Keywords: Bayes theorem; Obesity; artificial intelligence; decision trees; predictive analytics.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
CHICA 20 question survey
Fig. 2
Fig. 2
Physician worksheet
Fig. 3
Fig. 3
Attributes contained in the first three levels of the tree.
Fig. 4
Fig. 4
Relationship between tree depth and accurancy.
Fig. 5
Fig. 5
Relationship between tree depth and sensitivity.
Fig. 6
Fig. 6
Relationship between tree depth and area under the ROC curve.

References

    1. CDC. [12/3/2013]. Available from: http://www.cdc.gov/healthyyouth/obesity/facts.htm.
    1. Khan NA, Raine LB, Drollette ES, Scudder MR, Pontifex MB, Castelli DM, Donovan SM, Evans EM, Hillman CH. Impact of the FITKids Physical Activity Intervention on Adiposity in Prepubertal Children. Pediatrics 2014; 133: e875–e883. - PMC - PubMed
    1. Mollard RC, Senechal M, MacIntosh AC, Hay J, Wicklow BA, Wittmeier KD, Sellers EA, Dean HJ, Ryner L, Berard L, McGavock JM. Dietary determinants of hepatic steatosis and visceral adiposity in overweight and obese youth at risk of type 2 diabetes. The American journal of clinical nutrition 2014; 99: 804–812. - PubMed
    1. Wing RR, Bolin P, Brancati FL, Bray GA, Clark JM, Coday M, Crow RS, Curtis JM, Egan CM, Espeland MA, Evans M, Foreyt JP, Ghazarian S, Gregg EW, Harrison B, Hazuda HP, Hill JO, Horton ES, Hubbard VS, Jakicic JM, Jeffery RW, Johnson KC, Kahn SE, Kitabchi AE, Knowler WC, Lewis CE, Maschak-Carey BJ, Montez MG, Murillo A, Nathan DM, Patricio J, Peters A, Pi-Sunyer X, Pownall H, Reboussin D, Regen-steiner JG, Rickman AD, Ryan DH, Safford M, Wadden TA, Wagenknecht LE, West DS, Williamson DF, Yanovski SZ. Cardiovascular effects of intensive lifestyle intervention in type 2 diabetes. New England journal of medicine 2013; 369: 145–154. - PMC - PubMed
    1. Wadden TA, Volger S, Sarwer DB, Vetter ML, Tsai AG, Berkowitz RI, Kumanyika S, Schmitz KH, Diewald LK, Barg R, Chittams J, Moore RH. A two-year randomized trial of obesity treatment in primary care practice. New England journal of medicine 2011; 365: 1969–1979. - PMC - PubMed

Publication types