Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Feb 11;10(1):017106.
doi: 10.1088/1752-7155/10/1/017106.

The use of a gas chromatography-sensor system combined with advanced statistical methods, towards the diagnosis of urological malignancies

Affiliations

The use of a gas chromatography-sensor system combined with advanced statistical methods, towards the diagnosis of urological malignancies

Raphael B M Aggio et al. J Breath Res. .

Abstract

Prostate cancer is one of the most common cancers. Serum prostate-specific antigen (PSA) is used to aid the selection of men undergoing biopsies. Its use remains controversial. We propose a GC-sensor algorithm system for classifying urine samples from patients with urological symptoms. This pilot study includes 155 men presenting to urology clinics, 58 were diagnosed with prostate cancer, 24 with bladder cancer and 73 with haematuria and or poor stream, without cancer. Principal component analysis (PCA) was applied to assess the discrimination achieved, while linear discriminant analysis (LDA) and support vector machine (SVM) were used as statistical models for sample classification. Leave-one-out cross-validation (LOOCV), repeated 10-fold cross-validation (10FoldCV), repeated double cross-validation (DoubleCV) and Monte Carlo permutations were applied to assess performance. Significant separation was found between prostate cancer and control samples, bladder cancer and controls and between bladder and prostate cancer samples. For prostate cancer diagnosis, the GC/SVM system classified samples with 95% sensitivity and 96% specificity after LOOCV. For bladder cancer diagnosis, the SVM reported 96% sensitivity and 100% specificity after LOOCV, while the DoubleCV reported 87% sensitivity and 99% specificity, with SVM showing 78% and 98% sensitivity between prostate and bladder cancer samples. Evaluation of the results of the Monte Carlo permutation of class labels obtained chance-like accuracy values around 50% suggesting the observed results for bladder cancer and prostate cancer detection are not due to over fitting. The results of the pilot study presented here indicate that the GC system is able to successfully identify patterns that allow classification of urine samples from patients with urological cancers. An accurate diagnosis based on urine samples would reduce the number of negative prostate biopsies performed, and the frequency of surveillance cystoscopy for bladder cancer patients. Larger cohort studies are planned to investigate the potential of this system. Future work may lead to non-invasive breath analyses for diagnosing urological conditions.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Pipeline for data analysis.
Figure 2
Figure 2
Chromatograms showing baseline correction using a 2 step procedure
Figure 3
Figure 3
Schematic for selecting the reference sample for chromatogram alignment.
Figure 4
Figure 4
Validation schemes used to validate the classifiers built by the pipeline presented here: (a) repeated 10-fold cross-validation; (b) repeated double cross-validation; (c) Repeated 10-fold cross-validation Monte Carlo; and (d) repeated double cross-validation Monte Carlo.
Figure 5
Figure 5
Boxplot (a) and two-factor principal component analysis (b) of data derived from patients with prostate cancer and controls
Figure 6
Figure 6
ROC curves of repeated double cross-validation applied to Prostate and Control urine sample data processed by the GC-sensor pipeline and modelled by linear discriminant analysis (LDA) and support vector machine with polynomial kernel (SVM-P).
Figure 7
Figure 7
Sensitivity and specificity data of biomarkers for prostate cancer. Grey rows = biomarkers found in the literature; Orange rows = GC-sensor pipeline; VOC = volatile organic compound; Validation = the statistical method used to validate sample classification; LOOCV = leave-one-out cross-validation; 10-FoldCV = 30 times repeated 10-fold cross-validation; DoubleCV = 30 time repeated double cross-validation (outer loop = 3 folds; inner loop = 10 fold cross-validation repeated 5 times); PSA = prostate-specific antigen; p2PSA = serum isoform [-2] proPSA; PHI = prostate healthy index; imPSA = intracellular macrophage PSA; PCA3 = prostate cancer antigen 3; Not validated = no statistical method was applied to validate results of sample classification.
Figure 8
Figure 8
Boxplot (a) and two-factor principal component analysis (b) of data derived from patients with bladder cancer and controls
Figure 9
Figure 9
ROC curves of repeated double cross-validation applied to Bladder and Control urine sample data processed by the GC-sensor pipeline and modelled by linear discriminant analysis (LDA) and support vector machine with polynomial kernel (SVM-P).
Figure 10
Figure 10
Sensitivity and specificity data of biomarkers for bladder cancer. Grey rows = biomarkers found in the literature; Orange rows = framework presented here; BTA = bladder tumor antigen; NMP-22 = nuclear matrix protein 22; Hb dipstick = hemoglobin dipstick; Cxbladder = Cxbladder Detect; VOC = volatile organic compound; Validation = the statistical method used to validate sample classification; LOOCV = leave-one-out cross-validation; 10-FoldCV= 30 times repeated 10-fold cross-validation; DoubleCV = 30 time repeats of 3-fold double cross-validation with an inner loop of 10-fold cross-validation repeated 5 times; Not specified = a statistical validation method for sample classification was not found in source.

References

    1. [Accessed June 6 2015];Cancer Research UK Web site. http://www.cancerresearchuk.org/cancer-info/cancerstats/types/bladder/in...
    1. Garg V, Gu NY, Borrego ME, Raisch DW. A literature review of cost-effectiveness analyses of prostate-specific antigen test in prostate cancer screening. Expert Review of Pharmacoeconomics & Outcomes Research. 2013;13(3):327–342. - PubMed
    1. Bradley LA, Palomaki GE, Gutman S, Samson D, Aronson N. Comparative Effectiveness Review: Prostate Cancer Antigen 3 Testing for the Diagnosis and Management of Prostate Cancer. Journal of Urology. 2013;190(2):389–398. - PubMed
    1. Snyder CF, Frick KD, Blackford AL, Herbert RJ, Neville BA, Carducci MA, Earle CC. How Does Initial Treatment Choice Affect Short-Term and Long-Term Costs for Clinically Localized Prostate Cancer? Cancer. 2010;116(23):5391–5399. - PubMed
    1. Etzioni RD. Review of evidence concerning PSA screening for prostate cancer has limitations as basis for policy development. Evidence-based medicine. 2013;18(2):75–6. - PubMed

Publication types