Applying machine learning to classify table olives using bacterial metataxonomic data
- PMID: 40615468
- PMCID: PMC12227649
- DOI: 10.1038/s41538-025-00496-7
Applying machine learning to classify table olives using bacterial metataxonomic data
Abstract
In recent years, metataxonomic analysis has been increasingly used to characterize microbial communities in fermented foods. Moreover, advances in bioinformatics and machine learning (ML) have expanded resources for analyzing these metataxonomic data. Particularly tree-based algorithms are valuable for their interpretability. This work compares the use of three tree-based ML algorithms-Classification and Regression Tree, Random Forest (RF), and Extreme Gradient Boosting- for the analysis of a database composed of 442 samples of 16S rRNA bacterial profiles obtained from table olives. Our findings show that ML techniques can effectively classify bacterial profiles based on olive processing type, cultivar, country of origin, and isolation matrix. The RF model achieved the highest accuracy, reaching 97% in the best cases, with a kappa coefficient above 0.8 for most categories. This approach holds potential applications in the table olive sector and in other food products, where the industrial application of ML techniques could enhance traceability, authenticity, and quality control.
© 2025. The Author(s).
Conflict of interest statement
Competing interests: The authors declare no competing interests.
Figures



Similar articles
-
Weed Detection Using Deep Learning: A Systematic Literature Review.Sensors (Basel). 2023 Mar 31;23(7):3670. doi: 10.3390/s23073670. Sensors (Basel). 2023. PMID: 37050730 Free PMC article.
-
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142. Br J Dermatol. 2024. PMID: 38581445
-
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340. Health Technol Assess. 2006. PMID: 16959170
-
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x. Respir Res. 2024. PMID: 39709425 Free PMC article. Review.
-
The Use of Machine Learning for Analyzing Real-World Data in Disease Prediction and Management: Systematic Review.JMIR Med Inform. 2025 Jun 19;13:e68898. doi: 10.2196/68898. JMIR Med Inform. 2025. PMID: 40537090 Free PMC article.
References
-
- Medina, E., Brenes, M., García-García, P., Romero, C. & De Castro, A. Microbial ecology along the processing of Spanish olives darkened by oxidation. Food Control86, 35–41 (2018).
-
- López-García, E. et al. Bacterial metataxonomic analysis of industrial Spanish-style green table olive fermentations. Food Control137, 10.1016/j.foodcont.2022.108969 (2022).
-
- Hurtado, A., Reguant, C., Bordons, A. & Rozès, N. Lactic acid bacteria from fermented table olives. Food Microbiol.31, 1–8 (2012). - PubMed
Grants and funding
- MMT24-IG-01/European Commission - NextGenerationEU, through Momentum CSIC Programme: Develop Your Digital Talent
- MMT24-IG-01/European Commission - NextGenerationEU, through Momentum CSIC Programme: Develop Your Digital Talent
- MMT24-IG-01/European Commission - NextGenerationEU, through Momentum CSIC Programme: Develop Your Digital Talent
LinkOut - more resources
Full Text Sources