. 2021 Dec 4;22(23):13124.

doi: 10.3390/ijms222313124.

UMPred-FRL: A New Approach for Accurate Prediction of Umami Peptides Using Feature Representation Learning

Phasit Charoenkwan¹, Chanin Nantasenamat², Md Mehedi Hasan³, Mohammad Ali Moni⁴, Balachandran Manavalan⁵, Watshara Shoombuatong²

Affiliations

¹ Modern Management and Information Technology, College of Arts, Media and Technology, Chiang Mai University, Chiang Mai 50200, Thailand.
² Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand.
³ Tulane Center for Biomedical Informatics and Genomics, Division of Biomedical Informatics and Genomics, John W. Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA 70112, USA.
⁴ Artificial Intelligence & Digital Health Data Science, School of Health and Rehabilitation Sciences, Faculty of Health and Behavioural Sciences, The University of Queensland, St Lucia, QLD 4072, Australia.
⁵ Department of Physiology, Ajou University School of Medicine, Suwon 16499, Korea.

PMID: 34884927
PMCID: PMC8658322
DOI: 10.3390/ijms222313124

UMPred-FRL: A New Approach for Accurate Prediction of Umami Peptides Using Feature Representation Learning

Phasit Charoenkwan et al. Int J Mol Sci. 2021.

. 2021 Dec 4;22(23):13124.

doi: 10.3390/ijms222313124.

Authors

Phasit Charoenkwan¹, Chanin Nantasenamat², Md Mehedi Hasan³, Mohammad Ali Moni⁴, Balachandran Manavalan⁵, Watshara Shoombuatong²

Affiliations

¹ Modern Management and Information Technology, College of Arts, Media and Technology, Chiang Mai University, Chiang Mai 50200, Thailand.
² Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand.
³ Tulane Center for Biomedical Informatics and Genomics, Division of Biomedical Informatics and Genomics, John W. Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA 70112, USA.
⁴ Artificial Intelligence & Digital Health Data Science, School of Health and Rehabilitation Sciences, Faculty of Health and Behavioural Sciences, The University of Queensland, St Lucia, QLD 4072, Australia.
⁵ Department of Physiology, Ajou University School of Medicine, Suwon 16499, Korea.

PMID: 34884927
PMCID: PMC8658322
DOI: 10.3390/ijms222313124

Abstract

Umami ingredients have been identified as important factors in food seasoning and production. Traditional experimental methods for characterizing peptides exhibiting umami sensory properties (umami peptides) are time-consuming, laborious, and costly. As a result, it is preferable to develop computational tools for the large-scale identification of available sequences in order to identify novel peptides with umami sensory properties. Although a computational tool has been developed for this purpose, its predictive performance is still insufficient. In this study, we use a feature representation learning approach to create a novel machine-learning meta-predictor called UMPred-FRL for improved umami peptide identification. We combined six well-known machine learning algorithms (extremely randomized trees, k-nearest neighbor, logistic regression, partial least squares, random forest, and support vector machine) with seven different feature encodings (amino acid composition, amphiphilic pseudo-amino acid composition, dipeptide composition, composition-transition-distribution, and pseudo-amino acid composition) to develop the final meta-predictor. Extensive experimental results demonstrated that UMPred-FRL was effective and achieved more accurate performance on the benchmark dataset compared to its baseline models, and consistently outperformed the existing method on the independent test dataset. Finally, to aid in the high-throughput identification of umami peptides, the UMPred-FRL web server was established and made freely available online. It is expected that UMPred-FRL will be a powerful tool for the cost-effective large-scale screening of candidate peptides with potential umami sensory properties.

Keywords: bioinformatics; feature representation learning; machine learning; sequence analysis; umami peptide.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

**Figure 1**
The overall flowchart of the development of UMPred-FRL. It consists of dataset construction, feature extraction, baseline model construction, new feature representation generation, and a final meta-predictor development.

**Figure 2**
Performance comparison of different baseline models. (A,B) Cross-validation and (C,D) independent test results of 42 baseline models. (A,C) The performance of 42 baseline models in terms of cross-validation and independent test ACC. (B,D) The average performance of each classifier over seven different feature descriptors on the training and independent test datasets, respectively.

**Figure 3**
Performance evaluations of top 30 baseline models. (A,B) Cross-validation BACC and MCC of top 30 baseline models. (C,D) Independent test BACC and MCC of top 30 baseline models.

**Figure 4**
t-distributed stochastic neighbor embedding (t-SNE) distribution of the positive and negative samples on the training (A–C) and independent test (D–F) datasets, respectively. (A,D) AAC, (B,E) CTDC and (C,F) optimal PF.

**Figure 5**
SHAP values of informative 7 probabilistic features used for UMPred-FRL. SHAP values represent the directionality of the informative features, where positive and negative SHAP values represent positive (umami peptide) and negative (non-umami peptide) predictions.

**Figure 6**
Performance comparison of UMPred-FRL with the top five baseline models on the training (A,B) and independent test (C,D) datasets. Prediction results of UMPred-FRL and the top five baseline models in terms of ACC, BACC, Sn, Sp, and MCC. (C,D) ROC curves and AUC values of the top five baseline models.

**Figure 7**
Performance of the proposed UMPred-FRL and the existing method (iUmami-SCM) on training (A,B) and independent test (C,D) datasets. (A,B) Prediction results of UMPred-FRL and iUmami-SCM in terms of ACC, BACC, Sn, Sp and MCC. (C,D) ROC curves and AUC values of UMPred-FRL and iUmami-SCM.

See this image and copyright information in PMC

Cited by

Toward a general and interpretable umami taste predictor using a multi-objective machine learning approach.
Pallante L, Korfiati A, Androutsos L, Stojceski F, Bompotas A, Giannikos I, Raftopoulos C, Malavolta M, Grasso G, Mavroudi S, Kalogeras A, Martos V, Amoroso D, Piga D, Theofilatos K, Deriu MA. Pallante L, et al. Sci Rep. 2022 Dec 16;12(1):21735. doi: 10.1038/s41598-022-25935-3. Sci Rep. 2022. PMID: 36526644 Free PMC article.
Predicting multiple taste sensations with a multiobjective machine learning method.
Androutsos L, Pallante L, Bompotas A, Stojceski F, Grasso G, Piga D, Di Benedetto G, Alexakos C, Kalogeras A, Theofilatos K, Deriu MA, Mavroudi S. Androutsos L, et al. NPJ Sci Food. 2024 Jul 25;8(1):47. doi: 10.1038/s41538-024-00287-6. NPJ Sci Food. 2024. PMID: 39054312 Free PMC article.
TROLLOPE: A novel sequence-based stacked approach for the accelerated discovery of linear T-cell epitopes of hepatitis C virus.
Charoenkwan P, Waramit S, Chumnanpuen P, Schaduangrat N, Shoombuatong W. Charoenkwan P, et al. PLoS One. 2023 Aug 25;18(8):e0290538. doi: 10.1371/journal.pone.0290538. eCollection 2023. PLoS One. 2023. PMID: 37624802 Free PMC article.
StackTTCA: a stacking ensemble learning-based framework for accurate and high-throughput identification of tumor T cell antigens.
Charoenkwan P, Schaduangrat N, Shoombuatong W. Charoenkwan P, et al. BMC Bioinformatics. 2023 Jul 28;24(1):301. doi: 10.1186/s12859-023-05421-x. BMC Bioinformatics. 2023. PMID: 37507654 Free PMC article.
Rapid screening and taste mechanism of novel umami peptides from natural tripeptide database.
Lan J, Xiong Y, Dang K, Pan D, Du L, Wang Y, Dang Y. Lan J, et al. Food Chem X. 2025 May 23;28:102565. doi: 10.1016/j.fochx.2025.102565. eCollection 2025 May. Food Chem X. 2025. PMID: 40520701 Free PMC article.

See all "Cited by" articles

References

1. Behrens M., Meyerhof W., Hellfritsch C., Hofmann T. Sweet and umami taste: Natural products, their chemosensory targets, and beyond. Angew. Chem. Int. Ed. 2011;50:2220–2242. doi: 10.1002/anie.201002094. - DOI - PubMed
1. Zhang Y., Venkitasamy C., Pan Z., Liu W., Zhao L. Novel umami ingredients: Umami peptides and their taste. J. Food Sci. 2017;82:16–23. doi: 10.1111/1750-3841.13576. - DOI - PubMed
1. Temussi P.A. The good taste of peptides. J. Pept. Sci. 2012;18:73–82. doi: 10.1002/psc.1428. - DOI - PubMed
1. Dang Y., Gao X., Ma F., Wu X. Comparison of umami taste peptides in water-soluble extractions of Jinhua and Parma hams. LWT-Food Sci. Technol. 2015;60:1179–1186. doi: 10.1016/j.lwt.2014.09.014. - DOI
1. Wang W., Zhou X., Liu Y. Characterization and evaluation of umami taste: A review. Trends Anal. Chem. 2020;127:115876. doi: 10.1016/j.trac.2020.115876. - DOI

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

Grants and funding

2021R1A2C1014338/National Research Foundation of Korea

LinkOut - more resources

Full Text Sources
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

UMPred-FRL: A New Approach for Accurate Prediction of Umami Peptides Using Feature Representation Learning

Affiliations

UMPred-FRL: A New Approach for Accurate Prediction of Umami Peptides Using Feature Representation Learning

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Research Materials