Multi-basis continuous wavelet transform feature wavelengths selection and machine learning with hyperspectral imaging for non-destructive prediction of 1,2-propylene glycol content in cased tobacco leaves
- PMID: 40792803
- DOI: 10.1364/AO.565221
Multi-basis continuous wavelet transform feature wavelengths selection and machine learning with hyperspectral imaging for non-destructive prediction of 1,2-propylene glycol content in cased tobacco leaves
Abstract
The process of manufacturing tobacco casings constitutes a critical step in the production of tobacco leaves, exerting a substantial influence on the improvement of their physical and chemical characteristics, and consequently, the quality of the final product. Nevertheless, the prevailing approach to casing accuracy detection is predominantly focused on dosage monitoring, with a paucity of attention being paid to real-time effect evaluation. Current hyperspectral-based detection systems encounter difficulties in extracting trace additive features and managing high-dimensional data under limited sample conditions. A multi-basis continuous wavelet transform (CWT) and machine-learning-integrated framework for non-destructive propylene glycol (PG) content prediction were proposed in this paper, addressing precision limitations in tobacco quality monitoring. The preprocessing of hyperspectral imaging data from six PG concentration levels were undertaken via Savitzky-Golay filtering, followed by multiscale decomposition using three CWT basis functions with morlet, Mexican hat, and Gaussian wavelets. A dual optimization mechanism combining correlation threshold filtering and wavelength frequency statistics was developed to enable efficient feature wavelength selection. Furthermore, a stacking regression model was constructed and compared with standalone algorithms. The results demonstrated that the multiscale combined MMG strategy achieved 79.86% dimensionality reduction by selecting 58 feature wavelengths covering adjacent regions in the near-infrared and short-wave infrared (NIR-SWIR) range, significantly enhancing model generalization compared to full-spectrum inputs. Additionally, the stacking regression model attained optimal performance, with a testing set coefficient of determination of 0.9704, under combined MMG input through synergistic complementarity of heterogeneous base learners, with a root mean square error of 0.3188. It is confirmed in this paper that spectral feature interpretability is improved by multi-basis CWT decomposition through complementary wavelet-scale responses, and a novel, to our knowledge, non-destructive PG detection paradigm for industrial tobacco processing is established by the framework. Transferable insights for the hyperspectral analysis of trace components in agricultural products are provided by the methodology, and it could be applied to the non-destructive detection of trace additives for tobacco quality control.
Similar articles
-
Retrieval of nicotine content in cigar leaves by remote analysis of aerial hyperspectral combining machine learning methods.Sci Rep. 2025 Jan 31;15(1):3895. doi: 10.1038/s41598-025-88091-4. Sci Rep. 2025. PMID: 39890995 Free PMC article.
-
Comparative analysis of hyperspectral and near-infrared spectroscopy for bloodstain deposition time estimation.Anal Methods. 2025 Jul 31;17(30):6179-6189. doi: 10.1039/d4ay02167c. Anal Methods. 2025. PMID: 40692444
-
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.Comput Methods Programs Biomed. 2025 Sep;269:108899. doi: 10.1016/j.cmpb.2025.108899. Epub 2025 Jun 21. Comput Methods Programs Biomed. 2025. PMID: 40570739
-
Home treatment for mental health problems: a systematic review.Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150. Health Technol Assess. 2001. PMID: 11532236
-
Hybrid closed-loop systems for managing blood glucose levels in type 1 diabetes: a systematic review and economic modelling.Health Technol Assess. 2024 Dec;28(80):1-190. doi: 10.3310/JYPL3536. Health Technol Assess. 2024. PMID: 39673446 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Miscellaneous