Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Feb 14;12(2):491.
doi: 10.3390/diagnostics12020491.

Modular Point-of-Care Breath Analyzer and Shape Taxonomy-Based Machine Learning for Gastric Cancer Detection

Affiliations

Modular Point-of-Care Breath Analyzer and Shape Taxonomy-Based Machine Learning for Gastric Cancer Detection

Inese Polaka et al. Diagnostics (Basel). .

Abstract

Background: Gastric cancer is one of the deadliest malignant diseases, and the non-invasive screening and diagnostics options for it are limited. In this article, we present a multi-modular device for breath analysis coupled with a machine learning approach for the detection of cancer-specific breath from the shapes of sensor response curves (taxonomies of clusters).

Methods: We analyzed the breaths of 54 gastric cancer patients and 85 control group participants. The analysis was carried out using a breath analyzer with gold nanoparticle and metal oxide sensors. The response of the sensors was analyzed on the basis of the curve shapes and other features commonly used for comparison. These features were then used to train machine learning models using Naïve Bayes classifiers, Support Vector Machines and Random Forests.

Results: The accuracy of the trained models reached 77.8% (sensitivity: up to 66.54%; specificity: up to 92.39%). The use of the proposed shape-based features improved the accuracy in most cases, especially the overall accuracy and sensitivity.

Conclusions: The results show that this point-of-care breath analyzer and data analysis approach constitute a promising combination for the detection of gastric cancer-specific breath. The cluster taxonomy-based sensor reaction curve representation improved the results, and could be used in other similar applications.

Keywords: breath analysis; electronic nose; gastric cancer; machine learning; screening.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
The point-of-care device used in the study: (a) the design of the device, with a disposable mouthpiece inserted at the front; (b) the main blocks of the system.
Figure 2
Figure 2
Diagram of the data analysis process.
Figure 3
Figure 3
An example of the data after the preprocessing: (a) common features describing the curve; (b) curves of a GNP sensor; (c) curves of an analogue MOX sensor; (d) curves of a digital MOX sensor.
Figure 3
Figure 3
An example of the data after the preprocessing: (a) common features describing the curve; (b) curves of a GNP sensor; (c) curves of an analogue MOX sensor; (d) curves of a digital MOX sensor.
Figure 4
Figure 4
Cluster taxonomy of the responses from one gold nanoparticle sensor.
Figure 5
Figure 5
The overall accuracy of Naïve Bayes classifiers: mean values and 95% confidence intervals.
Figure 6
Figure 6
Sensitivity of Naïve Bayes classifiers: mean values and 95% confidence intervals.
Figure 7
Figure 7
Specificity of Naïve Bayes classifiers: mean values and 95% confidence intervals.
Figure 8
Figure 8
The area under the ROC curve of Naïve Bayes classifiers: mean values and 95% confidence intervals.
Figure 9
Figure 9
An example of the characteristic shapes used in a Naïve Bayes model: a taxonomy for GNP sensor responses cut at six clusters (a), taxonomies of two other GNP sensors cut at 10 and four clusters (b,c), and one MOXD sensor at five clusters (d); the dashed lines shows individual measurements, and the solid bold lines show the cluster-characteristic shapes.
Figure 10
Figure 10
The overall accuracy of Random Forests: mean values and 95% confidence intervals.
Figure 11
Figure 11
The overall accuracy of SVMs: mean values and 95% confidence intervals.

References

    1. Tan Y.K., Fielding J.W.L. Early diagnosis of early gastric cancer. Eur. J. Gastroenterol. Hepatol. 2006;18:821–829. doi: 10.1097/00042737-200608000-00004. - DOI - PubMed
    1. Daniel D.A.P., Thangavel K. Breathomics for gastric cancer classification using back-propagation neural network. J. Med. Signals Sens. 2016;6:172–182. doi: 10.4103/2228-7477.186879. - DOI - PMC - PubMed
    1. Amor R.E., Nakhleh M.K., Barash O., Haick H. Breath analysis of cancer in the present and the future. Eur. Respir. Rev. 2019;28:190002. doi: 10.1183/16000617.0002-2019. - DOI - PMC - PubMed
    1. Yang H.-Y., Chen W.-C., Tsai R.-C. Accuracy of the Electronic Nose Breath Tests in Clinical Application: A Systematic Review and Meta-Analysis. Biosensors. 2021;11:469. doi: 10.3390/bios11110469. - DOI - PMC - PubMed
    1. Peng G., Hakim M., Broza Y.Y., Billan S., Abdah-Bortnyak R., Kuten A., Tisch U., Haick H. Detection of lung, breast, colorectal, and prostate cancers from exhaled breath using a single array of nanosensors. Br. J. Cancer. 2010;103:542–551. doi: 10.1038/sj.bjc.6605810. - DOI - PMC - PubMed

LinkOut - more resources