. 2023 Dec 29;24(1):209.

doi: 10.3390/s24010209.

mid-DeepLabv3+: A Novel Approach for Image Semantic Segmentation Applied to African Food Dietary Assessments

Thierry Roland Baban A Erep¹, Lotfi Chaari¹

Affiliations

PMID: 38203070
PMCID: PMC10781344
DOI: 10.3390/s24010209

mid-DeepLabv3+: A Novel Approach for Image Semantic Segmentation Applied to African Food Dietary Assessments

Thierry Roland Baban A Erep et al. Sensors (Basel). 2023.

. 2023 Dec 29;24(1):209.

doi: 10.3390/s24010209.

Authors

Thierry Roland Baban A Erep¹, Lotfi Chaari¹

Affiliation

¹ Toulouse INP, University of Toulouse, Institut de Recherche en Informatique de Toulouse, 31400 Toulouse, France.

PMID: 38203070
PMCID: PMC10781344
DOI: 10.3390/s24010209

Abstract

Recent decades have witnessed the development of vision-based dietary assessment (VBDA) systems. These systems generally consist of three main stages: food image analysis, portion estimation, and nutrient derivation. The effectiveness of the initial step is highly dependent on the use of accurate segmentation and image recognition models and the availability of high-quality training datasets. Food image segmentation still faces various challenges, and most existing research focuses mainly on Asian and Western food images. For this reason, this study is based on food images from sub-Saharan Africa, which pose their own problems, such as inter-class similarity and dishes with mixed-class food. This work focuses on the first stage of VBDAs, where we introduce two notable contributions. Firstly, we propose mid-DeepLabv3+, an enhanced food image segmentation model based on DeepLabv3+ with a ResNet50 backbone. Our approach involves adding a middle layer in the decoder path and SimAM after each extracted backbone feature layer. Secondly, we present CamerFood10, the first food image dataset specifically designed for sub-Saharan African food segmentation. It includes 10 classes of the most consumed food items in Cameroon. On our dataset, mid-DeepLabv3+ outperforms benchmark convolutional neural network models for semantic image segmentation, with an mIoU (mean Intersection over Union) of 65.20%, representing a +10.74% improvement over DeepLabv3+ with the same backbone.

Keywords: CNN; CamerFood10 dataset; food segmentation; semantic segmentation.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

**Figure 1**
Different kinds of Cameroonian food with a similar yellow texture.

**Figure 2**
Some images in the CamerFood10 dataset with mask overlay.

**Figure 3**
**CamerFood10** class occurrence distribution.

**Figure 4**
CamerFood10 size distribution of masks from each class based on the number of pixels they occupy in the whole image (i.e., small, medium, large). Small object size < 5% of image; medium object between 5% and 20% of image; large objects > 20% of image.

**Figure 5**
Architecture of our proposed model: mid-Deeplabv3+.

**Figure 6**
**mid-DeepLabv3+’s** feature extraction backbone based on a scaled-down version of the ResNet50 architecture. This is the ResNet50 model without its fifth convolution block (Conv5).

**Figure 7**
Several images in the CamerFood10 dataset and ground truth mask and prediction with mid-DeepLabv3+ and other benchmark models. For spatial reasons, we only present the predictions of the models with the best results.

See this image and copyright information in PMC

Cited by

Lightweight DeepLabv3+ for Semantic Food Segmentation.
Muñoz B, Martínez-Arroyo A, Acevedo C, Aguilar E. Muñoz B, et al. Foods. 2025 Apr 9;14(8):1306. doi: 10.3390/foods14081306. Foods. 2025. PMID: 40282708 Free PMC article.

References

1. World Health Organization . Noncommunicable Diseases: Progress Monitor 2022. World Health Organization; Genova, Switzerland: 2022.
1. Iriti M., Varoni E.M., Vitalini S. Healthy diets and modifiable risk factors for non-communicable diseases—The European perspective. Foods. 2020;9:940. doi: 10.3390/foods9070940. - DOI - PMC - PubMed
1. Min W., Jiang S., Liu L., Rui Y., Jain R. A survey on food computing. ACM Comput. Surv. 2019;52:1–36. doi: 10.1145/3329168. - DOI
1. Wang W., Min W., Li T., Dong X., Li H., Jiang S. A review on vision-based analysis for automatic dietary assessment. Trends Food Sci. Technol. 2022;122:223–237. doi: 10.1016/j.tifs.2022.02.017. - DOI
1. Subhi M.A., Ali S.H., Mohammed M.A. Vision-based approaches for automatic food recognition and dietary assessment: A survey. IEEE Access. 2019;7:35370–35381. doi: 10.1109/ACCESS.2019.2904519. - DOI

MeSH terms

Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

mid-DeepLabv3+: A Novel Approach for Image Semantic Segmentation Applied to African Food Dietary Assessments

Affiliation

mid-DeepLabv3+: A Novel Approach for Image Semantic Segmentation Applied to African Food Dietary Assessments

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Related information

LinkOut - more resources

Full Text Sources