Visual nutrition analysis: leveraging segmentation and regression for food nutrient estimation
- PMID: 39742105
- PMCID: PMC11685081
- DOI: 10.3389/fnut.2024.1469878
Visual nutrition analysis: leveraging segmentation and regression for food nutrient estimation
Abstract
Introduction: Nutrition is closely related to body health. A reasonable diet structure not only meets the body's needs for various nutrients but also effectively prevents many chronic diseases. However, due to the general lack of systematic nutritional knowledge, people often find it difficult to accurately assess the nutritional content of food. In this context, image-based nutritional evaluation technology can provide significant assistance. Therefore, we are dedicated to directly predicting the nutritional content of dishes through images. Currently, most related research focuses on estimating the volume or area of food through image segmentation tasks and then calculating its nutritional content based on the food category. However, this method often lacks real nutritional content labels as a reference, making it difficult to ensure the accuracy of the predictions.
Methods: To address this issue, we combined segmentation and regression tasks and used the Nutrition5k dataset, which contains detailed nutritional content labels but no segmentation labels, for manual segmentation annotation. Based on these annotated data, we developed a nutritional content prediction model that performs segmentation first and regression afterward. Specifically, we first applied the UNet model to segment the food, then used a backbone network to extract features, and enhanced the feature expression capability through the Squeeze-and-Excitation structure. Finally, the extracted features were processed through several fully connected layers to obtain predictions for the weight, calories, fat, carbohydrates, and protein content.
Results and discussion: Our model achieved an outstanding average percentage mean absolute error (PMAE) of 17.06% for these components. All manually annotated segmentation labels can be found at https://doi.org/10.6084/m9.figshare.26252048.v1.
Keywords: Nutrition5k; deep learning; image segmentation; nutrition estimation; regression.
Copyright © 2024 Zhao, Zhu, Jiang and Xia.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures









Similar articles
-
Rapid Non-Destructive Analysis of Food Nutrient Content Using Swin-Nutrition.Foods. 2022 Oct 29;11(21):3429. doi: 10.3390/foods11213429. Foods. 2022. PMID: 36360043 Free PMC article.
-
Tailoring the Nutritional Composition of Italian Foods to the US Nutrition5k Dataset for Food Image Recognition: Challenges and a Comparative Analysis.Nutrients. 2024 Oct 1;16(19):3339. doi: 10.3390/nu16193339. Nutrients. 2024. PMID: 39408306 Free PMC article.
-
Automatic segmentation of pericardial adipose tissue from cardiac MR images via semi-supervised method with difference-guided consistency.Med Phys. 2025 Mar;52(3):1679-1692. doi: 10.1002/mp.17558. Epub 2024 Dec 5. Med Phys. 2025. PMID: 39636531
-
mid-DeepLabv3+: A Novel Approach for Image Semantic Segmentation Applied to African Food Dietary Assessments.Sensors (Basel). 2023 Dec 29;24(1):209. doi: 10.3390/s24010209. Sensors (Basel). 2023. PMID: 38203070 Free PMC article.
-
Annotation-efficient training of medical image segmentation network based on scribble guidance in difficult areas.Int J Comput Assist Radiol Surg. 2024 Jan;19(1):87-96. doi: 10.1007/s11548-023-02931-0. Epub 2023 May 26. Int J Comput Assist Radiol Surg. 2024. PMID: 37233894
Cited by
-
2D Prediction of the Nutritional Composition of Dishes from Food Images: Deep Learning Algorithm Selection and Data Curation Beyond the Nutrition5k Project.Nutrients. 2025 Jun 30;17(13):2196. doi: 10.3390/nu17132196. Nutrients. 2025. PMID: 40647299 Free PMC article.
References
-
- Jiang L, Qiu B, Liu X, Huang C, Lin K. Deepfood: Food image analysis and dietary assessment via deep model. IEEE Access. (2020) 8:47477–89. 10.1109/ACCESS.2020.2973625 - DOI
-
- Situju SF, Takimoto H, Sato S, Yamauchi H, Kanagawa A, Lawi A. Food constituent estimation for lifestyle disease prevention by multi-task cnn. Appl Artif Intellig. (2019) 33:732–46. 10.1080/08839514.2019.1602318 - DOI
Associated data
LinkOut - more resources
Full Text Sources