Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Apr 7;16(19):8394-8404.
doi: 10.1039/d5sc01526j. eCollection 2025 May 14.

Machine-learning based classification of 2D-IR liquid biopsies enables stratification of melanoma relapse risk

Affiliations

Machine-learning based classification of 2D-IR liquid biopsies enables stratification of melanoma relapse risk

Kelly Brown et al. Chem Sci. .

Abstract

Non-linear laser spectroscopy methods such as two-dimensional infrared (2D-IR) produce large, information-rich datasets, while developments in laser technology have brought substantial increases in data collection rates. This combination of data depth and quantity creates the opportunity to unite advanced data science approaches, such as Machine Learning (ML), with 2D-IR to reveal insights that surpass those from established data interpretation methods. To demonstrate this, we show that ML and 2D-IR spectroscopy can classify blood serum samples collected from patients with melanoma according to diagnostically-relevant groupings. Using just 20 μL samples, 2D-IR measures 'protein amide I fingerprints', which reflect the protein profile of blood serum. A hyphenated Partial Least Squares-Support Vector Machine (PLS-SVM) model was able to classify 2D-protein fingerprints taken from 40 patients with melanoma according to the presence, absence or later development of metastatic disease. Area under the receiver operating characteristic curve (AUROC) values of 0.75 and 0.86 were obtained when identifying samples from patients who were radiologically cancer free and with metastatic disease respectively. The model was also able to classify (AUROC = 0.80) samples from a third group of patients who were radiologically cancer-free at the point of testing but would go on to develop metastatic disease within five years. This ability to identify post-treatment patients at higher risk of relapse from a spectroscopic measurement of biofluid protein content shows the potential for hybrid 2D-IR-ML analyses and raises the prospect of a new route to an optical blood-based test capable of risk stratification for melanoma patients.

PubMed Disclaimer

Conflict of interest statement

There are no conflicts to declare.

Figures

Fig. 1
Fig. 1. Average 2D-IR spectra obtained with Tw = 250 fs for each of the different patient classification groups (a) control (Cont, 48 spectra), (b) developed metastasis (Dev, 33 spectra) and (c) metastatic (Met, 63 spectra). Difference spectra determined for subtraction of the averaged spectra of the three classes from each other are shown in panels (d) metastatic – control, (e) developed metastasis – control and (f) metastatic – developed metastasis. (g) to (i) show the difference spectra from (d)–(f) expanded by the multiplication factor shown.
Fig. 2
Fig. 2. Average performance metric outputs across outer folds for test samples, (a) average confusion matrix, (b) average ROC curves for each class: control (green), developed metastasis (gold) and metastatic (pink) (c) prediction probability box plots across all outer folds for each class, with jitter points added showing each individual probability value and range. Clear separation of the target category from the others show the model's confidence in producing a classification.
Fig. 3
Fig. 3. (a) Average variable importance of projection scores across training set of outer folds. VIP scores greater than 1 indicate important latent variables used in making model predictions. Panels (b) to (d) shows the spectral loadings of the three variables considered to be the most important, (b) = LV 15, (c) = LV 13, (d) = LV 14. Panels (e–g) reproduce the difference spectra from Fig. 1(g–i) for comparison. Coloured arrows highlight points of interest as discussed in the text.

References

    1. Hamm P. and Zanni M. T., Concepts And Method Of 2D Infrared Spectroscopy, Cambridge University Press, Cambridge, 2011
    1. Khalil M. Demirdoven N. Tokmakoff A. J. Phys. Chem. A. 2003;107:5258–5279. doi: 10.1021/jp0219247. - DOI
    1. Chung J. K. Thielges M. C. Fayer M. D. Proc. Natl. Acad. Sci. U. S. A. 2011;108:3578–3583. doi: 10.1073/pnas.1100587108. - DOI - PMC - PubMed
    1. Ishikawa H. Kwak K. Chung J. K. Kim S. Fayer M. D. Proc. Natl. Acad. Sci. U. S. A. 2008;105:8619–8624. doi: 10.1073/pnas.0803764105. - DOI - PMC - PubMed
    1. Ishikawa H. Finkelstein I. J. Kim S. Kwak K. Chung J. K. Wakasugi K. Massari A. M. Fayer M. D. Proc. Natl. Acad. Sci. U. S. A. 2007;104:16116–16121. doi: 10.1073/pnas.0707718104. - DOI - PMC - PubMed

LinkOut - more resources