"Snap-n-Eat": Food Recognition and Nutrition Estimation on a Smartphone

Weiyu Zhang¹, Qian Yu¹, Behjat Siddiquie¹, Ajay Divakaran², Harpreet Sawhney¹

Affiliations

PMID: 25901024
PMCID: PMC4604540
DOI: 10.1177/1932296815582222

"Snap-n-Eat": Food Recognition and Nutrition Estimation on a Smartphone

Weiyu Zhang et al. J Diabetes Sci Technol. 2015 May.

. 2015 May;9(3):525-33.

doi: 10.1177/1932296815582222. Epub 2015 Apr 21.

Authors

Weiyu Zhang¹, Qian Yu¹, Behjat Siddiquie¹, Ajay Divakaran², Harpreet Sawhney¹

Affiliations

¹ SRI International, Princeton, NJ, USA.
² SRI International, Princeton, NJ, USA ajay.divakaran@sri.com.

PMID: 25901024
PMCID: PMC4604540
DOI: 10.1177/1932296815582222

Abstract

We present snap-n-eat, a mobile food recognition system. The system can recognize food and estimate the calorific and nutrition content of foods automatically without any user intervention. To identify food items, the user simply snaps a photo of the food plate. The system detects the salient region, crops its image, and subtracts the background accordingly. Hierarchical segmentation is performed to segment the image into regions. We then extract features at different locations and scales and classify these regions into different kinds of foods using a linear support vector machine classifier. In addition, the system determines the portion size which is then used to estimate the calorific and nutrition content of the food present on the plate. Previous approaches have mostly worked with either images captured in a lab setting, or they require additional user input (eg, user crop bounding boxes). Our system achieves automatic food detection and recognition in real-life settings containing cluttered backgrounds. When multiple food items appear in an image, our system can identify them and estimate their portion size simultaneously. We implemented this system as both an Android smartphone application and as a web service. In our experiments, we have achieved above 85% accuracy when detecting 15 different kinds of foods.

Keywords: food recognition; mobile food recognition; nutrition estimation; visual food recognition.

PubMed Disclaimer

Conflict of interest statement

Declaration of Conflicting Interests: The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Figures

**Figure 1.**
Overview of proposed approach.

**Figure 2.**
We computed the low level features inside labeled food items. The top 2 images show the embedding of the HOG feature. The bottom 2 images show the embedding of SIFT feature at scales of 16 × 16 and 32 × 32. Each colored dot represents 1 labeled food instance.

**Figure 3.**
Saliency computation. The left image shows a picture of a food plate snapped by a user. The center image shows the saliency image with red/yellow colors indicating high saliency and blue colors indicating a low saliency. The right image shows the binary saliency map obtained by applying a threshold on the center image which can be used to identify the location of food in the image. Note that the binary saliency map picks out the most important regions of the image.

**Figure 4.**
Sampled segmentation. The figure shows an example of our segmentation. The food items present in a food plate are segmented into different regions, each of which usually corresponds to a single food item. The regions in green correspond to the steak, the regions in blue correspond to fruit, and the regions in red correspond to the roll. Classifiers are then run on each of these regions.

**Figure 5.**
Semantic ambiguities in food items. Left: 4 food items are assigned the same label “sandwiches.” Right: 4 visually similar items are assigned different labels.

**Figure 6.**
Food ontology. An example of a food ontology with an enlarged view of a small section.

**Figure 7.**
Examples of nutrition estimation on the test data set. In each image the food items and the detected class label below it have been given the same color.

See this image and copyright information in PMC

References

1. Puri M, Zhu Z, Yu Q, Divakaran A, Sawhney H. Recognition and volume estimation of food intake using a mobile device. Paper presented at: IEEE Workshop on Applications of Computer Vision; December 2009; Snowbird, UT.
1. Weiss R, Stumbo P, Divakaran A. Automatic food documentation and volume computation using digital imaging and electronic transmission. J Am Diet Assoc. 2010;42:44-110. - PMC - PubMed
1. Yang S, Chen M, Pomerleau D, Sukthankar R. Food recognition using statistics of pairwise local features. Paper presented at: IEEE Conference on Computer Vision and Pattern Recognition; June 2010; San Francisco, CA.
1. Chen MY, Yang YH, Ho CJ, et al. Automatic Chinese food identification and quantity estimation. Paper presented at: SIGGRAPH Asia; November 2012; Singapore.
1. Anthimopoulos M, Dehais J, Diem P, Mougiakakou S. Segmentation and recognition of multi-food meal images for carbohydrate counting. Paper presented at: IEEE 13th International Conference on Bioinformatics and Bioengineering; November 2013; Chania, Greece.

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations
Medical
- ClinicalTrials.gov
- MedlinePlus Health Information
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

"Snap-n-Eat": Food Recognition and Nutrition Estimation on a Smartphone

Affiliations

"Snap-n-Eat": Food Recognition and Nutrition Estimation on a Smartphone

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Miscellaneous