. 2024 Nov;7(11):e70042.

doi: 10.1002/cnr2.70042.

Development of a Serum Metabolome-Based Test for Early-Stage Detection of Multiple Cancers

Rajnish Nagarkar¹, Mamillapalli Gopichand², Suparna Kanti Pal^{3

4}, Ankur Gupta^{5

6}, Najmuddin Md Saquib^{5

6}, Ahmad Ahmad⁵, Ganga Sagar⁵, Kanury V S Rao^{5

6}, Zaved Siddiqui^{5

6}, Imliwati Longkumer⁷

Affiliations

¹ HCG Manavta Cancer Centre, Nashik, Maharashtra, India.
² HealthCare Global Enterprises Limited, City Cancer Centre, Vijayawada, India.
³ Indoriv Clinical, Kolkata, India.
⁴ Department of Radiotherapy, IPGME&R, Kolkata, India.
⁵ PredOmix Technologies Private Limited, Gurugram, India.
⁶ PredOmix Health Sciences Private Limited, Singapore, Singapore.
⁷ North East Cancer Hospital and Research Institute, Jorabat, Guwahati, Assam, India.

PMID: 39559978
PMCID: PMC11574562
DOI: 10.1002/cnr2.70042

Development of a Serum Metabolome-Based Test for Early-Stage Detection of Multiple Cancers

Rajnish Nagarkar et al. Cancer Rep (Hoboken). 2024 Nov.

. 2024 Nov;7(11):e70042.

doi: 10.1002/cnr2.70042.

Authors

Affiliations

¹ HCG Manavta Cancer Centre, Nashik, Maharashtra, India.
² HealthCare Global Enterprises Limited, City Cancer Centre, Vijayawada, India.
³ Indoriv Clinical, Kolkata, India.
⁴ Department of Radiotherapy, IPGME&R, Kolkata, India.
⁵ PredOmix Technologies Private Limited, Gurugram, India.
⁶ PredOmix Health Sciences Private Limited, Singapore, Singapore.
⁷ North East Cancer Hospital and Research Institute, Jorabat, Guwahati, Assam, India.

PMID: 39559978
PMCID: PMC11574562
DOI: 10.1002/cnr2.70042

Abstract

Background: Detection of cancer at the early stage currently offers the only viable strategy for reducing disease-related morbidity and mortality. Various approaches for multi-cancer early detection are being explored, which largely rely on capturing signals from circulating analytes shed by tumors into the blood. The fact that biomarker concentrations are limiting in the early stages of cancer, however, compromises the accuracy of these tests. We, therefore, adopted an alternate approach that involved interrogation of the serum metabolome with machine learning-based data analytics. Here, we monitored for modulations in metabolite patterns that correlated with the presence or absence of cancer. Results obtained confirmed the efficacy of this approach by demonstrating that it could detect a total of 15 cancers in women with an average accuracy of about 99%.

Aims: To further increase the scope of our test, we conducted an investigator-initiated clinical trial involving a total of 6445 study participants, which included both cancer patients and non-cancer volunteers. Our goal here was to maximize the number of cancers that could be detected, while also covering cancers in both females and males.

Methods and results: Metabolites extracted from individual serum samples were profiled by ultra-performance liquid chromatography coupled to a high-resolution mass spectrometer using an untargeted protocol. After processing, the data were analyzed by our cancer detection machine-learning algorithm to differentiate cancer from non-cancer samples. Results revealed that our test platform could indeed detect a total of 30 cancers, covering both females and males, with an average accuracy of ~98%. Importantly, the high detection accuracy remained invariant across all four stages of the cancers.

Conclusion: Thus, our approach of integrating untargeted metabolomics with machine learning-powered data analytics offers a powerful strategy for early-stage multi-cancer detection with high accuracy.

Trial registration: Registration No: CTRI/2023/03/050316.

Keywords: artificial intelligence; cancer; metabolomics.

PubMed Disclaimer

Conflict of interest statement

A.G, G.S., Z.S., N.M.S., A.A., and K.V.S.R. are fulltime employees of PredOmix Technologies Private Limited. K.V.S.R. is a cofounder and owns stock in both PredOmix Technologies Private Limited and PredOmix Health Sciences Pte. Ltd. A.G., Z.S., and N.M.S. own stock in PredOmix Health Sciences Pte. Ltd. The work described in this report is included in the Patent Cooperation Filing. International application No. PCT/SG2024/050022. The other authors declare no conflicts of interest.

Figures

**FIGURE 1**
Pipeline for data processing and development of the CDAI algorithm. Figure illustrates the workflow for development of the CDAI model, which also includes the data pre‐processing steps employed. A detailed explanation of the individual steps is provided in the Section 2.

**FIGURE 2**
k‐fold cross validation of the CDAI algorithm. To cross‐validate the model we used the k (k = 20) fold cross validation where the train test dataset was split into 20 parts. Figure shows the results for test sensitivities for each fold along with their 95% CI. The blue line shows the average sensitivity of the CDAI model.

**FIGURE 3**
Differentiating cancer samples from normal controls using the CDAI algorithm. (A) This panel shows the results obtained for normal samples, and the samples from each of the cancer types tested in the validation set, using the CDAI algorithm. A scatter plot depicting the distribution of y‐scores for individual samples either in the normal control group, or in each of the 30 cancer types tested, is presented here. Whereas the normal control group is denoted as Noc, individual cancer types are abbreviated as Br, breast cancer; Endo, endometrial cancer; Cv, cervical cancer; Ov, ovarian cancer; Lung, lung cancer; AML, acute myeloid leukemia; Thy, thyroid cancer; Mel, melanoma, Colo, colorectal cancer; Kdy, kidney cancer; NHL, Non‐Hodgkin's lymphoma; Panc, pancreatic cancer; Liv, liver and bile duct cancer; Gst, gastric cancer, H&N, head and neck cancer; Osp, oesophageal cancer; Bld, bladder cancer; Brn, brain and CNS cancer; MMY, multiple myeloma; GB, gall bladder cancer; Sarc, sarcoma; Prost, prostate cancer; Tstr, testicular cancer; Vulval; vulval cancer; Anal, anal cancer; Vgnl, vaginal cancer; Penile, penile cancer; Unk. Pr, cancer of unknown primary origin; GCT, germ cell tumors; SCC, squamous cell carcinoma. Panel (B) gives the resultant confusion matrix obtained from this data, after taking a y‐score of 0 as the threshold score for distinguishing cancer‐positive samples from normal controls. Values obtained for sensitivity and specificity are also given. The ROC‐AUC probability curve plot, as an indicator of CDAI performance, is shown in panel (C). The area under the curve (AUC) obtained was 0.99.

**FIGURE 4**
High fidelity performance of the CDAI is maintained across cancer types, subject age groups, and cancer stages. Panel (A) gives the sensitivity of cancer detection achieved by the CDAI for the validation subset of samples in each of the 30 cancer types tested. While the abbreviation for the cancers used here is the same as than in Figure 3A, the number of cancer‐positive samples identified, against the total number of samples tested, for each of the cancer types is included in parenthesis along the X‐axis. Panel (B) gives the CDAI cancer‐detection sensitivity obtained as a function of the age‐group distribution of the cancer patients while panel (C) shows a plot of detection sensitivity versus stage of the cancer. In both the latter panels the number of samples identified as cancer positive by the CDAI, versus the total number of validation set‐samples tested in each sub‐group is given in parenthesis along the X‐axis. For all panels, bars indicate 95% CI.

**FIGURE 5**
Stage‐specific CDAI test performance for representative cancers. Stage‐specific sensitivities obtained using the CDAI on the validation subset of 12 representative cancers are shown here. The number of samples identified as cancer positive, against the total number of samples tested at each stage of each of the cancers is also included in parenthesis and the bars indicate 95% CI.

See this image and copyright information in PMC

References

1. Bray F., Laversanne M., Weiderpass E., and Soerjomataram I., “The Ever‐Increasing Importance of Cancer as a Leading Cause of Premature Death Worldwide,” Cancer 127 (2021): 3029–3030. - PubMed
1. Siegel R. L., Miller K. D., Fuchs H. E., and Jemal A., “Cancer Statistics, 2021,” CA: A Cancer Journal for Clinicians 71, no. 1 (2021): 7–33. - PubMed
1. Zugazagoitia J., Guedes C., Ponce S., Ferrer I., Molina‐Pinelo S., and Paz‐Ares L., “Current Challenges in Cancer Treatment,” Clinical Therapeutics 38 (2016): 1551–1566. - PubMed
1. Zappa C. and Mousa S. A., “Non‐small Cell Lung Cancer: Current Treatment and Future Advances,” Translational Lung Cancer Research 5, no. 3 (2016): 288–300. - PMC - PubMed
1. The American Cancer Society , “Key Statistics About Stomach Cancer,” (2020).

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
- PubMed Central
- Wiley
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Development of a Serum Metabolome-Based Test for Early-Stage Detection of Multiple Cancers

Affiliations

Development of a Serum Metabolome-Based Test for Early-Stage Detection of Multiple Cancers

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Medical