Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Feb 10;12(1):2301.
doi: 10.1038/s41598-022-06274-9.

A non-invasive method for concurrent detection of early-stage women-specific cancers

Affiliations

A non-invasive method for concurrent detection of early-stage women-specific cancers

Ankur Gupta et al. Sci Rep. .

Abstract

We integrated untargeted serum metabolomics using high-resolution mass spectrometry with data analysis using machine learning algorithms to accurately detect early stages of the women specific cancers of breast, endometrium, cervix, and ovary across diverse age-groups and ethnicities. A two-step approach was employed wherein cancer-positive samples were first identified as a group. A second multi-class algorithm then helped to distinguish between the individual cancers of the group. The approach yielded high detection sensitivity and specificity, highlighting its utility for the development of multi-cancer detection tests especially for early-stage cancers.

PubMed Disclaimer

Conflict of interest statement

A.G, G.S., Z.S. and N.S. are fulltime employees of PredOmix Technologies Private Limited. K.V.S.R., R.A. and S.N. are cofounders and own stock in both PredOmix Technologies Private Limited and PredOmix, Inc. The work described in this report is the subject of an International PCT filing. Application No. PCT/US21/48337.

Figures

Figure 1
Figure 1
Ion chromatograms of representative samples from the normal control and the individual cancer groups. The total run time for the LC resolution was 14 min, with every sample run being alternated with a blank run. For the blank run involved injection of a 1:1 mixture of methanol and water. Comparatively, spectra in each case were following a trend with major changes seen from 200 to 600 m/z with the time ranging from 3 to 11 min.
Figure 2
Figure 2
Age-wise detection of detected metabolites. Figure provides a graphical representation of the number of metabolites detected across the individual age groups, for the normal control set as well as the individual cancer groups. The cumulative unique metabolites detected in normal control samples were 5895. While, endometrial, breast, cervical and ovarian cancer samples were found to have 5971, 5982, 6300 and 6336 respectively.
Figure 3
Figure 3
Data processing pipeline. The data preprocessing pipeline used to render the data amenable to AI modeling is depicted here (details are given in the text).
Figure 4
Figure 4
PLSDA plot distinguishes between the individual cancers and also the normal controls. Figure presents a PLSDA plot of the matrix of sample-specific metabolites versus metabolite intensity for normal controls and the individual women-specific cancer sets. The separation obtained between the individual groups is shown. The R2 and Q2 values obtained are given.
Figure 5
Figure 5
AI workflow for distinguishing BECO cancers from normal controls and its application. Panel A depicts the AI workflow employed to test the AI model for distinguishing between the women-specific cancer group (BECO) from the Normal controls. Panel B depicts the results from testing of the trained model for distinguishing women-specific cancers (BECO) from normal controls showing clear separation of disease. The separation achieved between the cancer and the control group is shown in the form of a confusion matrix, with the resulting sensitivity and specificity values also given.
Figure 6
Figure 6
Partitioning of training and test data sets for the multiclass AI model. (A) shows the segregation of the individual cancer sets for training and testing of the multiclass AI model-2 (see text) for distinguishing between the individual cancers of the BECO group.
Figure 7
Figure 7
Testing the multiclass model for its ability to distinguish the individual cancer groups. Panel (A) shows the results of specifically testing the multiclass trained model for separation of endometrial cancer samples from the other cancers (breast, cervical, ovarian) based on model’s Endometrial scores. The resulting confusion matrix on applying a threshold shows good accuracy, sensitivity and specificity. Panel (B) shows the results of specifically testing the multiclass trained model for separation of breast cancer samples from the other cancers (endometrial, cervical, and ovarian) based on model’s Breast scores. The resulting confusion matrix on applying a threshold shows good accuracy, sensitivity and specificity. Panel (C) the results of specifically testing the multiclass trained model for separation of cervical cancer samples from the other cancers (breast, endometrial, ovarian) based on model’s Cervical scores. The resulting confusion matrix on applying a threshold shows good accuracy, sensitivity and specificity. Panel (D) shows the results of specifically testing the multiclass trained model for separation of ovarian cancer samples from the other cancers (breast, endometrial, cervical) based on model’s Ovarian scores. The resulting confusion matrix on applying a threshold shows high accuracy, sensitivity and specificity.
Figure 8
Figure 8
Preparation and scheduling of QC and samples for UHPLC-MS/MS. A small aliquot of each sample (coloured cylinders) was pooled to create a QC sample (multi-coloured cylinder), which was then injected periodically (every 50th injection) throughout the batch run. Variability among consistently detected metabolites was used to estimate overall process and batch variability. Every sample injection was followed by a blank injection to prevent carryover between the sample runs.

References

    1. Dagenais GR, Leong DP, Rangarajan S, et al. Variations in common diseases, hospital admissions, and deaths in middle-aged adults in 21 countries from five continents (PURE): A prospective cohort study. Lancet. 2019;395:785–794. doi: 10.1016/S0140-6736(19)32007-0. - DOI - PubMed
    1. Fitzmaurice C, Akinyemiju TF, Al Lami FH, et al. Global, regional, and national cancer incidence, mortality, years of life lost, years lived with disability, and disability-adjusted life-years for 29 cancer groups, 1990 to 2016: A systematic analysis for the Global Burden of Disease study. JAMA Oncol. 2018;4:1553–1568. doi: 10.1001/jamaoncol.2018.2706. - DOI - PMC - PubMed
    1. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2018;68:394–424. doi: 10.3322/caac.21492. - DOI - PubMed
    1. Torre LA, Islami F, Siegel RL, Ward EM, Jemal A. Global cancer in women: Burden and trends. Cancer Epidemiol. Biomark. Prev. 2017;26:444–457. doi: 10.1158/1055-9965.EPI-16-0858. - DOI - PubMed
    1. Sung H, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2021;71:209–249. doi: 10.3322/caac.21660. - DOI - PubMed

Substances