Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Apr 1:8:100729.
doi: 10.1016/j.crfs.2024.100729. eCollection 2024.

Learning algorithms for identification of whisky using portable Raman spectroscopy

Affiliations

Learning algorithms for identification of whisky using portable Raman spectroscopy

Kwang Jun Lee et al. Curr Res Food Sci. .

Abstract

Reliable identification of high-value products such as whisky is vital due to rising issues of brand substitution and quality control in the industry. We have developed a novel framework that can perform whisky analysis directly from raw spectral data with no human intervention by integrating machine learning models with a portable Raman device. We demonstrate that machine learning models can achieve over 99% accuracy in brand or product identification across twenty-eight commercial samples. To demonstrate the flexibility of this approach, we utilized the same algorithms to quantify ethanol concentrations, as well as measuring methanol levels in spiked whisky samples. To demonstrate the potential use of these algorithms in a real-world environment we tested our algorithms on spectral measurements performed through the original whisky bottle. Through the bottle measurements are facilitated by a beam geometry hitherto not applied to whisky brand identification in conjunction with machine learning. Removing the need for decanting greatly enhances the practicality and commercial potential of this technique, enabling its use in detecting counterfeit or adulterated spirits and other high-value liquids. The techniques established in this paper aim to function as a rapid and non-destructive initial screening mechanism for detecting falsified and tampered spirits, complementing more comprehensive and stringent analytical methods.

Keywords: Brand identification; Machine learning; Raman spectroscopy; Whisky.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Image 1
Graphical abstract
Fig. 1
Fig. 1
Brand identification: (a) Deep learning model accuracies per epoch on the training set. (b) Accuracy of deep learning model on the test set and training time. The blue and red dots represent the test accuracy and training time results, respectively. Each dot represents an increase in accuracy and training time as the epochs increase by 100, 200, 500, 1000, and 2000. Shaded grey areas show where the model has > 96% accuracy. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.)
Fig. 2
Fig. 2
Ethanol content prediction: (a) Using PCA + HPM with 2000 epochs (RP2 = 0.998 and RMSEP = 0.25%). (b) Using PCA + FCN with 200 epochs applied to a dataset with previously unseen whisky brands (red dot) and gin samples (green x) (RU2 = 0.863 and RMSEU = 2.47% for the test set). Methanol content prediction: (c) Using PCA + HPM with 1000 epochs. The spectral data of Talisker, Cragganmore, 40% ethanol/water, Caol Ila, and Cynelish spiked with methanol were divided into training (60%), validation (20%), and test (20%) sets. (d) Using PCA + HPM with 1000 epochs. The spectral data of Talisker, Cragganmore, and 40% ethanol/water spiked with methanol were split into training (80%) and validation (20%) sets. The spectra of Caol Ila and Cynelish spiked with methanol were only used as a test set. The blue and red dots represent the results of the training and test sets, respectively. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.)
Fig. 3
Fig. 3
Brand identification accuracy of conventional machine learning models. The blue and red dots represent the results of the training and test sets, respectively, with PCA. The dashed lines show the respective accuracy levels without PCA. The red and blue dashed lines overlap each other on the LDA and ANN graphs due to the identical accuracy they achieved on both the training and test sets. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.)
Fig. 4
Fig. 4
Methanol content prediction using a PLSR model (PLSR7). The spectral data of Talisker, Cragganmore, and 40% ethanol/water spiked with methanol were used as a training set. The spectra of Caol Ila and Cynelish spiked with methanol were only used as a test set. The blue and red dots represent the prediction results on the training and unseen test sets, respectively. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.)

Similar articles

References

    1. Ashok Praveen C., Praveen Bavishna B., Dholakia K. Near infrared spectroscopic analysis of single malt Scotch whisky on an optofluidic chip. 2011;19(23):22982–22992. - PubMed
    1. Ashok Praveen C., Praveen Bavishna B., Dholakia Kishan. Optofluidic Raman sensor for simultaneous detection of the toxicity and quality of alcoholic beverages. J. Raman Spectrosc. 2013;44(6):795–797.
    1. Bzdok D., Altman N., Krzywinski M. Statistics versus machine learning. Nat. Methods. 2018;15(4):233–234. - PMC - PubMed
    1. Chaudhry P.E., Zimmerman A., Peters J.R., Cordell V.V. Preserving intellectual property rights: managerial insight into the escalating counterfeit market quandary. Bus. Horiz. 2009;52(1):57–66.
    1. Chollet F. Simon and Schuster; 2017. Deep Learning with Python.

LinkOut - more resources