Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 4;9(1):121.
doi: 10.1038/s41538-025-00496-7.

Applying machine learning to classify table olives using bacterial metataxonomic data

Affiliations

Applying machine learning to classify table olives using bacterial metataxonomic data

Elio López-García et al. NPJ Sci Food. .

Abstract

In recent years, metataxonomic analysis has been increasingly used to characterize microbial communities in fermented foods. Moreover, advances in bioinformatics and machine learning (ML) have expanded resources for analyzing these metataxonomic data. Particularly tree-based algorithms are valuable for their interpretability. This work compares the use of three tree-based ML algorithms-Classification and Regression Tree, Random Forest (RF), and Extreme Gradient Boosting- for the analysis of a database composed of 442 samples of 16S rRNA bacterial profiles obtained from table olives. Our findings show that ML techniques can effectively classify bacterial profiles based on olive processing type, cultivar, country of origin, and isolation matrix. The RF model achieved the highest accuracy, reaching 97% in the best cases, with a kappa coefficient above 0.8 for most categories. This approach holds potential applications in the table olive sector and in other food products, where the industrial application of ML techniques could enhance traceability, authenticity, and quality control.

PubMed Disclaimer

Conflict of interest statement

Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Application of AI to olive bacterial metataxonomic data.
Results of the 3 ML models CART, RF, and XGB for data classification according to olive processing type [A Prediction accuracy results and B Kappa coefficient results], olive cultivar [C Prediction accuracy results and D Kappa coefficient results], country of origin [E Prediction accuracy results and F Kappa coefficient results], and isolation matrix [G Prediction accuracy results and H Kappa coefficient results].
Fig. 2
Fig. 2. Ranking of discriminant bacterial genera.
Top 20 most important bacterial genera deduced from the RF model for data classification according to their A type of olive processing, B olive cultivar, C country of origin, D isolation matrix.
Fig. 3
Fig. 3
Workflow diagram used in the present study including data acquisition, preprocessing, building supervised ML models, and finally validation with test data.

Similar articles

References

    1. Medina, E., Brenes, M., García-García, P., Romero, C. & De Castro, A. Microbial ecology along the processing of Spanish olives darkened by oxidation. Food Control86, 35–41 (2018).
    1. Benítez-Cabello, A., Romero-Gil, V., Medina-Pradas, E., Garrido-Fernández, A. & Arroyo-López, F. N. Exploring bacteria diversity in commercialized table olive biofilms by metataxonomic and compositional data analysis. Sci. Rep.10, 1–13 (2020). - PMC - PubMed
    1. Kazou, M., Tzamourani, A., Panagou, E. Z. & Tsakalidou, E. Unraveling the microbiota of natural black cv. Kalamata fermented olives through 16S and ITS metataxonomic analysis. Microorganisms8, 10.3390/microorganisms8050672 (2020). - PMC - PubMed
    1. López-García, E. et al. Bacterial metataxonomic analysis of industrial Spanish-style green table olive fermentations. Food Control137, 10.1016/j.foodcont.2022.108969 (2022).
    1. Hurtado, A., Reguant, C., Bordons, A. & Rozès, N. Lactic acid bacteria from fermented table olives. Food Microbiol.31, 1–8 (2012). - PubMed

LinkOut - more resources