Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul 31;14(8):816.
doi: 10.3390/jpm14080816.

Combination of Machine Learning Techniques to Predict Overweight/Obesity in Adults

Affiliations

Combination of Machine Learning Techniques to Predict Overweight/Obesity in Adults

Alberto Gutiérrez-Gallego et al. J Pers Med. .

Abstract

(1) Background: Artificial intelligence using machine learning techniques may help us to predict and prevent obesity. The aim was to design an interpretable prediction algorithm for overweight/obesity risk based on a combination of different machine learning techniques. (2) Methods: 38 variables related to sociodemographic, lifestyle, and health aspects from 1179 residents in Madrid were collected and used to train predictive models. Accuracy, precision, and recall metrics were tested and compared between nine classical machine learning techniques and the predictive model based on a combination of those classical machine learning techniques. Statistical validation was performed. The shapely additive explanation technique was used to identify the variables with the greatest impact on weight gain. (3) Results: Cascade classifier model combining gradient boosting, random forest, and logistic regression models showed the best predictive results for overweight/obesity compared to all machine learning techniques tested, reaching an accuracy of 79%, precision of 84%, and recall of 89% for predictions for weight gain. Age, sex, academic level, profession, smoking habits, wine consumption, and Mediterranean diet adherence had the highest impact on predicting obesity. (4) Conclusions: A combination of machine learning techniques showed a significant improvement in accuracy to predict risk of overweight/obesity than machine learning techniques separately.

Keywords: artificial intelligence; machine learning; overweight/obesity; predictive model.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
Cascade classifier operation flow. Basis structure of proposed cascade classifier, combining gradient boosting (classifier 1), random forest (classifier 2), and logistic regression (classifier 3) machine learning techniques.
Figure 2
Figure 2
Bayesian ACC and density ACC using instances. Probability of wining for each classical machine learning technique separately and the cascade model. Panel (A): Results represented as Bayesian average coverage criterion. Panel (B): Results represented as density average coverage criterion.
Figure 2
Figure 2
Bayesian ACC and density ACC using instances. Probability of wining for each classical machine learning technique separately and the cascade model. Panel (A): Results represented as Bayesian average coverage criterion. Panel (B): Results represented as density average coverage criterion.
Figure 3
Figure 3
SHAP values. The model´s interpretation. Panel (A): Feature importance plot (after feature normalization). Panel (B): SHAP summary plots of the cascade flow model. Each row in the SHAP summary plot represents a feature, with the corresponding Shap values displayed along the x-axis. The features are ranked according to their average absolute Shap values, which represent the most important features of the model. Each point in the plot corresponds to a sample, with the color indicating the magnitude of the feature value, where red denotes larger values and blue denotes smaller values. Panel (C): Waterfall plot for explaining an individual´s prediction results in the validation cohort. The y axis shows the name of the variables and the x axis shows the Shap value. The red bar shows the positive contribution of the feature to the predicted value, and the blue bar shows the negative contribution of the feature to the predicted value.
Figure 3
Figure 3
SHAP values. The model´s interpretation. Panel (A): Feature importance plot (after feature normalization). Panel (B): SHAP summary plots of the cascade flow model. Each row in the SHAP summary plot represents a feature, with the corresponding Shap values displayed along the x-axis. The features are ranked according to their average absolute Shap values, which represent the most important features of the model. Each point in the plot corresponds to a sample, with the color indicating the magnitude of the feature value, where red denotes larger values and blue denotes smaller values. Panel (C): Waterfall plot for explaining an individual´s prediction results in the validation cohort. The y axis shows the name of the variables and the x axis shows the Shap value. The red bar shows the positive contribution of the feature to the predicted value, and the blue bar shows the negative contribution of the feature to the predicted value.

Similar articles

References

    1. Consumer Affairs Ministry of Health and Social Welfare. National Health Survey. Spain. 2017. [(accessed on 15 January 2021)]. Available online: https://www.sanidad.gob.es/estadEstudios/estadisticas/encuestaNacional/e.... (In Spanish)
    1. Fruh S.M. Obesity: Risk factors, complications, and strategies for sustainable long-term weight management. J. Am. Assoc. Nurse Pract. 2017;29:S3–S14. doi: 10.1002/2327-6924.12510. - DOI - PMC - PubMed
    1. WHO Obesity and Overweight. 2024. [(accessed on 15 July 2024)]. Available online: https://www.who.int/news-room/fact-sheets/detail/obesity-and-overweight.
    1. Telleria-Aramburu N., Arroyo-Izaga M. Risk factors of overweight/obesity-related lifestyles in university students: Results from the EHU12/24 study. Br. J. Nutr. 2022;127:914–926. doi: 10.1017/S0007114521001483. - DOI - PMC - PubMed
    1. Cheadle A., Atiedu A., Rauzon S., Schwartz P.M., Keene L., Davoudi M., Spring R., Molina M., Lee L., Boyle K., et al. A Community-Level Initiative to Prevent Obesity: Results from Kaiser Permanente’s Healthy Eating Active Living Zones Initiative in California. Am. J. Prev. Med. 2018;54((Suppl. S2)):S150–S159. doi: 10.1016/j.amepre.2018.01.024. - DOI - PubMed

LinkOut - more resources