Metabolomic analysis of Yunnan cigar tobacco leaves: impact of geography and climate on flavor characteristics and machine learning-based origin traceability
- PMID: 41789420
- PMCID: PMC12957283
- DOI: 10.3389/fpls.2025.1703429
Metabolomic analysis of Yunnan cigar tobacco leaves: impact of geography and climate on flavor characteristics and machine learning-based origin traceability
Abstract
To investigate how Yunnan's distinctive geographical and climatic conditions shape the unique metabolic profile of its cigar tobacco leaves (CTLs), and to establish a reliable method for origin traceability using machine learning, a non-targeted metabolomics analysis was conducted on 71 CTL samples collected from the Dominican Republic, Indonesia, and Yunnan, including Lincang, Pu'er, and Yuxi within Yunnan. A total of 778 highly reliable metabolites were identified. Influenced by Yunnan's high altitude, large diurnal temperature variation, intense ultraviolet radiation, and relative dryness, its CTLs exhibited characteristic metabolic profiles, with significant enrichment in pathways such as flavone and flavonol biosynthesis and betalain biosynthesis. Elevated levels of polyphenols, indoles, jasmonates, carotenoids, and other compounds were linked to Yunnan CTLs' distinct woody, roasted, and astringent flavor profile. Twelve key biomarkers were selected using Multivariate methods with unbiased variable selection in R (MUVR). Machine learning algorithms-including LDA, LR, GMM, KNN, and SVM-were applied to these biomarkers, achieving highly accurate origin discrimination across national (Yunnan vs. Dominican Republic/Indonesia) and regional (Lincang, Pu'er, Yuxi) scales. Validation results showed a median false classification rate of 0.1 over 100 iterations and an AUC close to 1, confirming the model's high accuracy and robustness for CTLs origin traceability.
Keywords: biomarkers; flavor profile; geographical origin; machine learning; metabolomics.
Copyright © 2026 Wu, Zhao, Li, Li, Wang, Yang, Lin, Yao, Jiao, Zhao, Li, Zhang, Zhao, Zhang and Wang.
Conflict of interest statement
GZ, WW, LY, TZ, JW was employed by company China Tobacco Yunnan Industrial Co., Ltd. The remaining author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures
References
-
- Abe S. S., Ashida K., Kamil M. I., Karyanto O., Hardjowigeno S., Tawaraya K. (2020). Land use and management effects on volcanic soils in West Sumatra, Indonesia. Geoderma Regional 22, e00308. doi: 10.1016/j.geodrs.2020.e00308 - DOI
-
- Acree T. E., Nishida R., Fukami H. (1985). Odor thresholds of the stereoisomers of methyl jasmonate. J. Agric. Food Chem. 33, 425–427. doi: 10.1021/jf00063a026 - DOI
-
- Ashihara H., Crozier A., Ludwig I. A. (2020). Plant nucleotide metabolism: Biosynthesis, degradation, and alkaloid formation (Hoboken, NJ: John Wiley & Sons Ltd; ).