Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jun 24;23(13):5864.
doi: 10.3390/s23135864.

Application of Machine Learning Algorithms to Classify Peruvian Pisco Varieties Using an Electronic Nose

Affiliations

Application of Machine Learning Algorithms to Classify Peruvian Pisco Varieties Using an Electronic Nose

Celso De-La-Cruz et al. Sensors (Basel). .

Abstract

Pisco is an alcoholic beverage obtained from grape juice distillation. Considered the flagship drink of Peru, it is produced following strict and specific quality standards. In this work, sensing results for volatile compounds in pisco, obtained with an electronic nose, were analyzed through the application of machine learning algorithms for the differentiation of pisco varieties. This differentiation aids in verifying beverage quality, considering the parameters established in its Designation of Origin". For signal processing, neural networks, multiclass support vector machines and random forest machine learning algorithms were implemented in MATLAB. In addition, data augmentation was performed using a proposed procedure based on interpolation-extrapolation. All algorithms trained with augmented data showed an increase in performance and more reliable predictions compared to those trained with raw data. From the comparison of these results, it was found that the best performance was achieved with neural networks.

Keywords: artificial neural network; beverage quality; electronic nose; gas sensors array; random forest; support vector machine.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Electronic nose. (a) Pisco sample, (b) hydraulic system, (c) sensing chamber, (d) temperature controller, (e) LabVIEW software interface.
Figure 2
Figure 2
(a) Platinum electrodes over alumina substrate, (b) gas sensor prepared from a metal oxide (MOx), (c) arrangement of sensors inside the sensing chamber, (d) schematic representation of the sensor.
Figure 3
Figure 3
Interpolation–extrapolation in the feature space for data augmentation. M is the set formed by the points of the blue line.
Figure 4
Figure 4
Interpolation–extrapolation in the time domain for data augmentation.
Figure 5
Figure 5
Example of the rising voltage response of the first group of sensors in one trial. Legend Pi is the i-th class of pisco variety.
Figure 6
Figure 6
Structure of the neural network to classify into 6 classes (varieties and brands).
Figure 7
Figure 7
Confusion matrix obtained after training the neural network without using augmented data.
Figure 8
Figure 8
Confusion matrix obtained after training the neural network using 2000 augmented data.
Figure 9
Figure 9
Results with the first dataset. Accuracy of the prediction of the ANN with the test data after training with different amounts of augmented data (0: red; 100: blue; 500: green; 2000: light blue). Ten trainings were generated for each case. The black line shows the mean accuracy for each case.
Figure 10
Figure 10
Results with the first dataset. Accuracy of the MSVM prediction with the test data after training with different amounts of augmented data (0: red; 100: blue; 500: green; 2000: light blue).
Figure 11
Figure 11
Results with the first dataset. Accuracy of the RF prediction with the test data after training with different amounts of augmented data (0: red; 100: blue; 500: green; 2000: light blue). Ten trainings were generated for each case. The black line shows the mean accuracy for each case.
Figure 12
Figure 12
Example of the rising voltage response of the first group of sensors in one trial. Legend Pi is the i-th class of pisco variety.
Figure 13
Figure 13
Results with the second dataset. Accuracy of the ANN prediction with the test data. Training with different amounts of augmented data (0: red; 100: blue; 500: green; 2000: light blue). Ten trainings were generated for each case. The black line shows the mean accuracy.
Figure 14
Figure 14
Results with the second dataset. Accuracy of the MSVM prediction with the test data after training with different amounts of augmented data (0: red; 100: blue; 500: green; 2000: light blue).
Figure 15
Figure 15
Results with the second dataset. Accuracy of the RF prediction with the test data. Training with different amounts of augmented data (0: red; 100: blue; 500: green; 2000: light blue). Ten trainings were generated for each case. The black line shows the mean accuracy.
Figure 16
Figure 16
Example of the rising voltage response of the third group of sensors in one trial. Legend Pi is the i-th class of pisco variety.
Figure 17
Figure 17
Results with the third dataset. Accuracy of the ANN prediction with the test data. Training with different amounts of augmented data (0: red; 100: blue; 500: green; 2000: light blue). Ten trainings were generated for each case. The black line shows the mean accuracy.
Figure 18
Figure 18
Results with the third dataset. Accuracy of the MSVM prediction with the test data after training with different amounts of augmented data (0: red; 100: blue; 500: green; 2000: light blue).
Figure 19
Figure 19
Results with the third dataset. Accuracy of the RF prediction with the test data. Training with different amounts of augmented data (0: red; 100: blue; 500: green; 2000: light blue). Ten trainings were generated for each case. The black line shows the mean accuracy.
Figure 20
Figure 20
Results with the third dataset. Accuracy of the ANN prediction with the test data as a function of the variation of the CV parameter. Training with 500 augmented data for each CV value. The red curve indicates the average of the bar and 10 subsequent values.

References

    1. Rossow I., Bye E., Moan I., Kilian C., Bramness J. Changes in Alcohol Consumption during the COVID-19 Pandemic—Small Change in Total Consumption, but Increase in Proportion of Heavy Drinkers. Int. J. Environ. Res. Public Health. 2021;18:4231. doi: 10.3390/ijerph18084231. - DOI - PMC - PubMed
    1. CONCYTEC . IVAI. Destilados Premium, Iniciativas de Vinculación Para Acelerar la Innovación. Consejo Nacional de Ciencia Tecnología e Innovación Tecnológica; Lima, Peru: 2022.
    1. Huertas L. Cronología de la Producción del Vino y del Pisco. Universidad Ricardo Palma; Lima, Peru: 2011.
    1. INDECOPI . Reglamento de la Denominación de Origen Pisco. Instituto Nacional de Defensa de la Competencia y de la Protección de la Propiedad Intelectual; Lima, Peru: 2011.
    1. INACAL . Bebidas Alcohólicas. Pisco. Requisitos. Instituto Nacional de Calidad; Lima, Peru: 2011. NTP 211.001.2006.

LinkOut - more resources