Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Dec 8;6(1):105.
doi: 10.1038/s41525-021-00257-x.

The salivary metatranscriptome as an accurate diagnostic indicator of oral cancer

Affiliations

The salivary metatranscriptome as an accurate diagnostic indicator of oral cancer

Guruduth Banavar et al. NPJ Genom Med. .

Abstract

Despite advances in cancer treatment, the 5-year mortality rate for oral cancers (OC) is 40%, mainly due to the lack of early diagnostics. To advance early diagnostics for high-risk and average-risk populations, we developed and evaluated machine-learning (ML) classifiers using metatranscriptomic data from saliva samples (n = 433) collected from oral premalignant disorders (OPMD), OC patients (n = 71) and normal controls (n = 171). Our diagnostic classifiers yielded a receiver operating characteristics (ROC) area under the curve (AUC) up to 0.9, sensitivity up to 83% (92.3% for stage 1 cancer) and specificity up to 97.9%. Our metatranscriptomic signature incorporates both taxonomic and functional microbiome features, and reveals a number of taxa and functional pathways associated with OC. We demonstrate the potential clinical utility of an AI/ML model for diagnosing OC early, opening a new era of non-invasive diagnostics, enabling early intervention and improved patient outcomes.

PubMed Disclaimer

Conflict of interest statement

The following authors are/were employees of Viome Inc, a commercial for-profit company, at the time of their contributions: G.B., O.O., R.T., S.R., F.C., P.J.T., S.G., M.P., A.P., H.T., and M.V. For the other authors there is no conflict of interest to the best of our knowledge.

Figures

Fig. 1
Fig. 1. Descriptive statistics of salivary metatranscriptome of the high-risk population (Cohort A in Table 1).
a Species richness; control median 463, case median 415 and function richness; control median 2306, case median 2205. b Shannon diversity index; control mean 2.25, case mean 2.20; and Inverse Simpson diversity index; control mean 3.41, case mean 3.26. c Using Mann–Whitney U tests and at least twofold difference in means (0.69 in CLR space), 139 differentially expressed species (at p < 0.05) up- or downregulated (red and blue respectively) in cases relative to controls, organized by genus and phylum (median difference in CLR values); the size of the bubble is inversely proportional to the p value. d Using Mann–Whitney U tests and at least twofold difference in means (0.69 in CLR space), 49 differentially expressed KOs (at p < 0.05) up- or downregulated in cases relative to controls, organized by KEGG level-3 and level-2 functional groups; the size of each triangle is inversely proportional to its p value e Clustermap using Euclidean distance of CLR transformed sum(transcripts per million) data for active function (KO) features significant by Mann–Whitney U tests. Features are shown with corrected p values < 0.01 and median CLR differences between the cohorts of greater than 0 or less than −1. KOs are color coded by their KEGG level-3 functional group.
Fig. 2
Fig. 2. Predictive performance of machine-learnt classifier trained with discovery dataset (Cohort A in Table 1).
a Distribution of classifier output probabilities across the sample set. b Sensitivity & specificity tradeoff with 95% confidence interval computed using the Clopper-Pearson method; at the default decision boundary of 0.5, sensitivity is 0.81 and specificity is 0.85. c ROC AUC of the classifier using the LOOCV method is 0.87 (blue curve); using differentially expressed features only is 0.76 (orange curve). d Classifier probabilities separated by gender. e Classifier probabilities separated by smoking status. f PCA analysis using top 100 features (PC1 and PC2 capture 10.2% and 6.3% of the total variation, respectively.). g Probability of cancer output from the classifier for control samples with and without interference from chewing gum, chewing tobacco, and brushing teeth.
Fig. 3
Fig. 3. Oral metatranscriptomic signature from the ML classifier trained with Cohort A from Table 1.
Effect sizes (coefficient values within the classification model) of 101 active species (circles) and 247 active KOs (triangles), grouped into curated Viome Functional Categories (VFC), see ‘Supplementary Note 4’ section of the Supplementary Material; sizes of circles or triangles are proportional to the CLR median difference in expression level between cases and controls.

Similar articles

Cited by

  • Microbiota changes: the unseen players in cervical cancer progression.
    Fong Amaris WM, de Assumpção PP, Valadares LJ, Moreira FC. Fong Amaris WM, et al. Front Microbiol. 2024 Feb 8;15:1352778. doi: 10.3389/fmicb.2024.1352778. eCollection 2024. Front Microbiol. 2024. PMID: 38389527 Free PMC article. Review.
  • Noncoding RNAs in oral cancer.
    Balakittnen J, Weeramange CE, Wallace DF, Duijf PHG, Cristino AS, Kenny L, Vasani S, Punyadeera C. Balakittnen J, et al. Wiley Interdiscip Rev RNA. 2023 May-Jun;14(3):e1754. doi: 10.1002/wrna.1754. Epub 2022 Aug 12. Wiley Interdiscip Rev RNA. 2023. PMID: 35959932 Free PMC article. Review.
  • A review of machine learning methods for cancer characterization from microbiome data.
    Teixeira M, Silva F, Ferreira RM, Pereira T, Figueiredo C, Oliveira HP. Teixeira M, et al. NPJ Precis Oncol. 2024 May 30;8(1):123. doi: 10.1038/s41698-024-00617-7. NPJ Precis Oncol. 2024. PMID: 38816569 Free PMC article. Review.
  • Saliva-based microfluidic point-of-care diagnostic.
    Pittman TW, Decsi DB, Punyadeera C, Henry CS. Pittman TW, et al. Theranostics. 2023 Jan 31;13(3):1091-1108. doi: 10.7150/thno.78872. eCollection 2023. Theranostics. 2023. PMID: 36793864 Free PMC article. Review.
  • A salivary biomarker panel to detect liver cirrhosis.
    Trevisan França de Lima L, Crawford DHG, Broszczak DA, Zhang X, Bridle R K, Punyadeera C. Trevisan França de Lima L, et al. iScience. 2023 Jun 5;26(7):107015. doi: 10.1016/j.isci.2023.107015. eCollection 2023 Jul 21. iScience. 2023. PMID: 37360686 Free PMC article.

References

    1. American Society of Clinical Oncology. Head and neck cancer guide. https://www.cancer.net/cancer-types/head-and-neck-cancer/introduction (2019).
    1. World Health Organization. Oral cancer. https://www.who.int/cancer/prevention/diagnosis-screening/oral-cancer/en/ (2020).
    1. Brian L, et al. Changes in abundance of oral microbiota associated with oral cancer. PLoS ONE. 2014;9:e98741. - PMC - PubMed
    1. Dedhia RC, Smith KJ, Johnson JT, Roberts M. The cost-effectiveness of community-based screening for oral cancer in high-risk males in the United States: a Markov decision analysis approach. Laryngoscope. 2011;121:952–960. - PMC - PubMed
    1. Brocklehurst, P. et al. Screening programmes for the early detection and prevention of oral cancer. Cochrane Database Syst. Rev. 11 (2013). - PMC - PubMed