Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug;2(2):33-44.
doi: 10.1002/lci2.25. Epub 2021 May 20.

Salivary Metabolites are Promising Non-Invasive Biomarkers of Hepatocellular Carcinoma and Chronic Liver Disease

Affiliations

Salivary Metabolites are Promising Non-Invasive Biomarkers of Hepatocellular Carcinoma and Chronic Liver Disease

Courtney E Hershberger et al. Liver Cancer Int. 2021 Aug.

Abstract

Background: Hepatocellular carcinoma (HCC) is a leading causes of cancer mortality worldwide. Improved tools are needed for detecting HCC so that treatment can begin as early as possible. Current diagnostic approaches and existing biomarkers, such as alpha-fetoprotein (AFP) lack sensitivity, resulting in too many false negative diagnoses. Machine-learning may be able to identify combinations of biomarkers that provide more robust predictions and improve sensitivity for detecting HCC. We sought to evaluate whether metabolites in patient saliva could distinguish those with HCC, cirrhosis, and those with no documented liver disease.

Methods and results: We tested 125 salivary metabolites from 110 individuals (43 healthy, 37 HCC, 30 cirrhosis) and identified 4 metabolites that displayed significantly different abundance between groups (FDR P <.2). We also developed four tree-based, machine-learning models, optimized to include different numbers of metabolites, that were trained using cross-validation on 99 patients and validated on a withheld test set of 11 patients. A model using 12 metabolites -octadecanol, acetophenone, lauric acid, 1-monopalmitin, dodecanol, salicylaldehyde, glycyl-proline, 1-monostearin, creatinine, glutamine, serine and 4-hydroxybutyric acid- had a cross-validated sensitivity of 84.8%, specificity of 92.4% and correctly classified 90% of the HCC patients in the test cohort. This model outperformed previously reported sensitivities and specificities for AFP (20-100ng/ml) (61%, 86%) and AFP plus ultrasound (62%, 88%).

Conclusions and impact: Metabolites detectable in saliva may represent products of disease pathology or a breakdown in liver function. Notably, combinations of salivary metabolites derived from machine-learning may serve as promising non-invasive biomarkers for the detection of HCC.

Keywords: Metabolomics; cirrhosis; liver cancer; machine learnings; risk factor.

PubMed Disclaimer

Conflict of interest statement

Competing Interests D.M.R. has an equity stake in Interpares Biomedicine, LLC. D.M.R., F.A., D.S.A hold intellectual property related to the detection of hepatocellular carcinoma.

Figures

Figure 1:
Figure 1:
Workflow diagram for data collection, processing, analysis and generation of predictive models for disease state classification using metabolite relative abundance.
Figure 2:
Figure 2:. Eight metabolites differ between patient cohorts.
a) Volcano plot depicting false discovery rate (FDR) and Log2 Fold Change (Log2 FC) derived for all metabolites in pair-wise comparisons of disease status, adjusted for differences in age and sex. Metabolites with an FDR P<.2 (dotted red line) are highlighted. b) Box plots displaying distribution of relative abundance stratified by disease status for significantly differing metabolites in at least one comparison (FDR P < .2, adjusted for age and sex).
Figure 3:
Figure 3:. Random Forest model predicts disease status from metabolite abundance.
a) An iterative random forest (iRF) approach was used, whereby RF models were generated after iteratively removing the metabolite with the lowest mean Gini score within a leave-one-out cross-validation (LOOCV) framework. The range (min, mean, max) of OOB error across the models is displayed. The model including 125 metabolites is shown in red (RF125), the model including 12 metabolites is shown in blue (iRF12) and the model including 4 metabolites is shown in red (iRF4). b) The range of Gini scores (minimum, mean, maximum) across all metabolites in model RF125. Red coloring indicates the selected metabolites for the iRF models. c) The range of Gini scores of metabolites included in the iRF8. d) The range of Gini scores of metabolites included in iRF4.
Figure 4:
Figure 4:. Classification of disease status predicted by decision tree model.
a) A decision tree model based on selected metabolites from the iterative random forest (iRF12) approach optimized with a classification accuracy of 86%. Colored squares indicate the disease status of each individual by disease status at each branch of the decision tree. b) Comparison of accuracy metrics from RF with all metabolites (RF125), iRFs with selected metabolites (iRF12, iRF4), and the decision tree models (DT) using leave-one-out cross-validation.

Similar articles

Cited by

References

    1. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2020. CA Cancer J Clin. United States; 2020;70:7–30. - PubMed
    1. Petrick JL, Kelly SP, Altekruse SF, McGlynn KA, Rosenberg PS. Future of Hepatocellular Carcinoma Incidence in the United States Forecast Through 2030. J Clin Oncol Off J Am Soc Clin Oncol 2016;34:1787–94. - PMC - PubMed
    1. Forner A, Reig M, Bruix J. Hepatocellular carcinoma. Lancet Lond Engl. England; 2018;391:1301–14. - PubMed
    1. Roerecke M, Vafaei A, Hasan OSM, Chrystoja BR, Cruz M, Lee R, et al.Alcohol Consumption and Risk of Liver Cirrhosis: A Systematic Review and Meta-Analysis. Am J Gastroenterol 2019;114:1574–86. - PMC - PubMed
    1. Sun C-A, Wu D-M, Lin C-C, Lu S-N, You S-L, Wang L-Y, et al.Incidence and cofactors of hepatitis C virus-related hepatocellular carcinoma: a prospective study of 12,008 men in Taiwan. Am J Epidemiol. United States; 2003;157:674–82. - PubMed