A Fine-Tuned Bidirectional Encoder Representations From Transformers Model for Food Named-Entity Recognition: Algorithm Development and Validation

doi:10.2196/28229

. 2021 Aug 9;23(8):e28229.

doi: 10.2196/28229.

A Fine-Tuned Bidirectional Encoder Representations From Transformers Model for Food Named-Entity Recognition: Algorithm Development and Validation

Riste Stojanov^#¹, Gorjan Popovski^#^{2

3}, Gjorgjina Cenikj^#^{2

3}, Barbara Koroušić Seljak^#², Tome Eftimov^#²

Affiliations

¹ Faculty of Computer Science and Engineering, Ss Cyril and Methodius, University- Skopje, Skopje, the Former Yugoslav Republic of Macedonia.
² Computer Systems Department, Jožef Stefan Institute, Ljubljana, Slovenia.
³ Jožef Stefan International Postgraduate School, Ljubljana, Slovenia.

^# Contributed equally.

PMID: 34383671
PMCID: PMC8415558
DOI: 10.2196/28229

A Fine-Tuned Bidirectional Encoder Representations From Transformers Model for Food Named-Entity Recognition: Algorithm Development and Validation

Riste Stojanov et al. J Med Internet Res. 2021.

. 2021 Aug 9;23(8):e28229.

doi: 10.2196/28229.

Authors

Riste Stojanov^#¹, Gorjan Popovski^#^{2

3}, Gjorgjina Cenikj^#^{2

3}, Barbara Koroušić Seljak^#², Tome Eftimov^#²

Affiliations

¹ Faculty of Computer Science and Engineering, Ss Cyril and Methodius, University- Skopje, Skopje, the Former Yugoslav Republic of Macedonia.
² Computer Systems Department, Jožef Stefan Institute, Ljubljana, Slovenia.
³ Jožef Stefan International Postgraduate School, Ljubljana, Slovenia.

^# Contributed equally.

PMID: 34383671
PMCID: PMC8415558
DOI: 10.2196/28229

Abstract

Background: Recently, food science has been garnering a lot of attention. There are many open research questions on food interactions, as one of the main environmental factors, with other health-related entities such as diseases, treatments, and drugs. In the last 2 decades, a large amount of work has been done in natural language processing and machine learning to enable biomedical information extraction. However, machine learning in food science domains remains inadequately resourced, which brings to attention the problem of developing methods for food information extraction. There are only few food semantic resources and few rule-based methods for food information extraction, which often depend on some external resources. However, an annotated corpus with food entities along with their normalization was published in 2019 by using several food semantic resources.

Objective: In this study, we investigated how the recently published bidirectional encoder representations from transformers (BERT) model, which provides state-of-the-art results in information extraction, can be fine-tuned for food information extraction.

Methods: We introduce FoodNER, which is a collection of corpus-based food named-entity recognition methods. It consists of 15 different models obtained by fine-tuning 3 pretrained BERT models on 5 groups of semantic resources: food versus nonfood entity, 2 subsets of Hansard food semantic tags, FoodOn semantic tags, and Systematized Nomenclature of Medicine Clinical Terms food semantic tags.

Results: All BERT models provided very promising results with 93.30% to 94.31% macro F1 scores in the task of distinguishing food versus nonfood entity, which represents the new state-of-the-art technology in food information extraction. Considering the tasks where semantic tags are predicted, all BERT models obtained very promising results once again, with their macro F1 scores ranging from 73.39% to 78.96%.

Conclusions: FoodNER can be used to extract and annotate food entities in 5 different tasks: food versus nonfood entities and distinguishing food entities on the level of food groups by using the closest Hansard semantic tags, the parent Hansard semantic tags, the FoodOn semantic tags, or the Systematized Nomenclature of Medicine Clinical Terms semantic tags.

Keywords: BERT; bidirectional encoder representations from transformers; fine-tuning BERT; food information extraction; information extraction; machine learning; named-entity recognition; natural language processing; semantic annotation.

©Riste Stojanov, Gorjan Popovski, Gjorgjina Cenikj, Barbara Koroušić Seljak, Tome Eftimov. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 09.08.2021.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: None declared.

Figures

**Figure 2**
Food named-entity recognition flowchart. BERT: bidirectional encoder representations from transformers; NER: named-entity recognition; SNOMED CT: Systematized Nomenclature of Medicine Clinical Terms.

**Figure 3**
An example of food entities available from one recipe that are present in the training data set. The entities are annotated using Hansard parent, Hansard closest, FoodOn, Systematized Nomenclature of Medicine Clinical Terms, and OntoFood (not studied in this paper) semantic tags.

**Figure 4**
Training and validation loss per fine-tuning epoch for the bio bidirectional encoder representations from transformers large model on the Hansard parent data set.

**Figure 5**
Macro F1 scores for all considered models for the food versus nonfood entity task. Each macro F1 score is obtained by using stratified k-fold cross-validation (k=5). Underlined values are best per subtable, while the bold value is the best from the whole table. BERT: bidirectional encoder representations from transformers; BiLSTM-CRF: bidirectional long short-term memory conditional random field; BuTTER: bidirectional long short-term memory for food named-entity recognition; NER: named-entity recognition.

**Figure 6**
Boxplots of macro F1 scores obtained by using stratified five-fold cross-validation for all considered models for the binary food classification task. BERT: bidirectional encoder representations from transformers; BiLSTM-CRF: bidirectional long short-term memory conditional random field.

**Figure 7**
Food named-entity recognition integration in FoodViz.

See this image and copyright information in PMC

Cited by

Integrating machine learning and artificial intelligence in life-course epidemiology: pathways to innovative public health solutions.
Chen S, Yu J, Chamouni S, Wang Y, Li Y. Chen S, et al. BMC Med. 2024 Sep 2;22(1):354. doi: 10.1186/s12916-024-03566-x. BMC Med. 2024. PMID: 39218895 Free PMC article. Review.
Zero-shot evaluation of ChatGPT for food named-entity recognition and linking.
Ogrinc M, Koroušić Seljak B, Eftimov T. Ogrinc M, et al. Front Nutr. 2024 Aug 13;11:1429259. doi: 10.3389/fnut.2024.1429259. eCollection 2024. Front Nutr. 2024. PMID: 39290564 Free PMC article.
From language models to large-scale food and biomedical knowledge graphs.
Cenikj G, Strojnik L, Angelski R, Ogrinc N, Koroušić Seljak B, Eftimov T. Cenikj G, et al. Sci Rep. 2023 May 15;13(1):7815. doi: 10.1038/s41598-023-34981-4. Sci Rep. 2023. PMID: 37188766 Free PMC article.
Decoding the Foodome: Molecular Networks Connecting Diet and Health.
Menichetti G, Barabási AL, Loscalzo J. Menichetti G, et al. Annu Rev Nutr. 2024 Aug;44(1):257-288. doi: 10.1146/annurev-nutr-062322-030557. Annu Rev Nutr. 2024. PMID: 39207880 Free PMC article. Review.
CafeteriaFCD Corpus: Food Consumption Data Annotated with Regard to Different Food Semantic Resources.
Ispirova G, Cenikj G, Ogrinc M, Valenčič E, Stojanov R, Korošec P, Cavalli E, Koroušić Seljak B, Eftimov T. Ispirova G, et al. Foods. 2022 Sep 2;11(17):2684. doi: 10.3390/foods11172684. Foods. 2022. PMID: 36076868 Free PMC article.

See all "Cited by" articles

References

1. Johan F, Owen G. Scaling 36 solutions to halve emissions by 2030. Exponential Roadmap. 2020. [2021-05-19]. https://exponentialroadmap.org/wp-content/uploads/2019/09/Exponential-Ro... .
1. Qiao L, Yang L, Hong D, Yao L, Zhiguang Q. Knowledge graph construction techniques. Journal of computer research and development. 2016;53(3):582. doi: 10.7544/issn1000-1239.2016.20148228. https://crad.ict.ac.cn/EN/10.7544/issn1000-1239.2016.20148228 - DOI - DOI
1. Zhou X, Zhang X, Hu X. MaxMatcher: Biological concept extraction using approximate dictionary lookup. Pacific Rim International Conference On Artificial Intelligence; August 7-11, 2006; Guilin, China. 2006. pp. 1145–1149. - DOI
1. Hanisch D, Fundel K, Mevissen H, Zimmer R, Fluck J. ProMiner: rule-based protein and gene entity recognition. BMC Bioinformatics. 2005;6(Suppl 1):S14. doi: 10.1186/1471-2105-6-s1-s14. - DOI - PMC - PubMed
1. Eftimov T, Koroušić Seljak Barbara, Korošec Peter. A rule-based named-entity recognition method for knowledge extraction of evidence-based dietary recommendations. PLoS One. 2017;12(6):e0179488. doi: 10.1371/journal.pone.0179488. https://dx.plos.org/10.1371/journal.pone.0179488 PONE-D-16-46189 - DOI - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources

[1] Johan F, Owen G. Scaling 36 solutions to halve emissions by 2030. Exponential Roadmap. 2020. [2021-05-19]. https://exponentialroadmap.org/wp-content/uploads/2019/09/Exponential-Ro... .

[2] Johan F, Owen G. Scaling 36 solutions to halve emissions by 2030. Exponential Roadmap. 2020. [2021-05-19]. https://exponentialroadmap.org/wp-content/uploads/2019/09/Exponential-Ro... .

[3] Qiao L, Yang L, Hong D, Yao L, Zhiguang Q. Knowledge graph construction techniques. Journal of computer research and development. 2016;53(3):582. doi: 10.7544/issn1000-1239.2016.20148228. https://crad.ict.ac.cn/EN/10.7544/issn1000-1239.2016.20148228 - DOI - DOI

[4] Qiao L, Yang L, Hong D, Yao L, Zhiguang Q. Knowledge graph construction techniques. Journal of computer research and development. 2016;53(3):582. doi: 10.7544/issn1000-1239.2016.20148228. https://crad.ict.ac.cn/EN/10.7544/issn1000-1239.2016.20148228 - DOI - DOI

[5] Zhou X, Zhang X, Hu X. MaxMatcher: Biological concept extraction using approximate dictionary lookup. Pacific Rim International Conference On Artificial Intelligence; August 7-11, 2006; Guilin, China. 2006. pp. 1145–1149. - DOI

[6] Zhou X, Zhang X, Hu X. MaxMatcher: Biological concept extraction using approximate dictionary lookup. Pacific Rim International Conference On Artificial Intelligence; August 7-11, 2006; Guilin, China. 2006. pp. 1145–1149. - DOI

[7] Hanisch D, Fundel K, Mevissen H, Zimmer R, Fluck J. ProMiner: rule-based protein and gene entity recognition. BMC Bioinformatics. 2005;6(Suppl 1):S14. doi: 10.1186/1471-2105-6-s1-s14. - DOI - PMC - PubMed

[8] Hanisch D, Fundel K, Mevissen H, Zimmer R, Fluck J. ProMiner: rule-based protein and gene entity recognition. BMC Bioinformatics. 2005;6(Suppl 1):S14. doi: 10.1186/1471-2105-6-s1-s14. - DOI - PMC - PubMed

[9] Eftimov T, Koroušić Seljak Barbara, Korošec Peter. A rule-based named-entity recognition method for knowledge extraction of evidence-based dietary recommendations. PLoS One. 2017;12(6):e0179488. doi: 10.1371/journal.pone.0179488. https://dx.plos.org/10.1371/journal.pone.0179488 PONE-D-16-46189 - DOI - DOI - PMC - PubMed

[10] Eftimov T, Koroušić Seljak Barbara, Korošec Peter. A rule-based named-entity recognition method for knowledge extraction of evidence-based dietary recommendations. PLoS One. 2017;12(6):e0179488. doi: 10.1371/journal.pone.0179488. https://dx.plos.org/10.1371/journal.pone.0179488 PONE-D-16-46189 - DOI - DOI - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A Fine-Tuned Bidirectional Encoder Representations From Transformers Model for Food Named-Entity Recognition: Algorithm Development and Validation

Affiliations

A Fine-Tuned Bidirectional Encoder Representations From Transformers Model for Food Named-Entity Recognition: Algorithm Development and Validation

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources