Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jul 29:2:697254.
doi: 10.3389/fragi.2021.697254. eCollection 2021.

Adapting Blood DNA Methylation Aging Clocks for Use in Saliva Samples With Cell-type Deconvolution

Affiliations

Adapting Blood DNA Methylation Aging Clocks for Use in Saliva Samples With Cell-type Deconvolution

Fedor Galkin et al. Front Aging. .

Abstract

DeepMAge is a deep-learning DNA methylation aging clock that measures the organismal pace of aging with the information from human epigenetic profiles. In blood samples, DeepMAge can predict chronological age within a 2.8 years error margin, but in saliva samples, its performance is drastically reduced since aging clocks are restricted by the training set domain. However, saliva is an attractive fluid for genomic studies due to its availability, compared to other tissues, including blood. In this article, we display how cell type deconvolution and elastic net can be used to expand the domain of deep aging clocks to other tissues. Using our approach, DeepMAge's error in saliva samples was reduced from 20.9 to 4.7 years with no retraining.

Keywords: DNA methylation; aging; aging clock; biogerontology; cell-type deconvolution; deep learning; domain adaptation; epigenetics.

PubMed Disclaimer

Conflict of interest statement

FG, KK, AZ, PM were employed by Deep Longevity Limited - part of Endurance RP Limited (SEHK: 0575.HK). AZ was employed by Insilico Medicine Hong Kong Limited. DeepMAge is a patent-pending aging clock.

Figures

FIGURE 1
FIGURE 1
Publicly available studies of DNAm in saliva samples have high inter- and intra-study cell composition variability. (A) Average cell composition of saliva samples in the studies used, estimated with EpiDISH DNAm deconvolution. N = Total number of samples in a study. (B) Immune cell distributions within the studies used. Boxes correspond to the IQR, whiskers protrude no further than 1.5×IQR. Boxes are colored according to their inclusion in the training or verification set.
FIGURE 2
FIGURE 2
Cell-type adjustment (“total age-transformed”) significantly improves the performance of DeepMAge (originally developed for blood samples) in the domain of saliva samples. (A) Training set, unadjusted: R 2 = 0.73. (B) Training set, adjusted: R 2 = 0.95. (C) Verification set, unadjusted: R 2 = 0.63. (D) Verification set, adjusted: R 2 = 0.92. R 2 = coefficient of determination. Predictions for the training set were obtained with leave-one-study-out cross-validation.
FIGURE 3
FIGURE 3
The reported adjustment significantly increases the accuracy of age prediction in all studies, despite their different cell-type composition (Figure 1). (A) Prior to the adjustment DeepMAge Blood predicts chronological age in saliva samples with a MAE of 26.36 years in the training set and 20.86 years in the verification set (Table 1). Numbers above boxes stand for the number of samples in each study. (B) After the “total age-transformed” adjustment DeepMAge’s error in saliva samples drops to a MAE of 4.57 years in the training set and 4.74 years in the verification set, despite the variable cell type composition of the samples across studies (Figure 1A). Boxes correspond to the interquartile region (IQR), whiskers protrude no further than 1.5×IQR. Boxes are colored according to their inclusion in the training or verification set.
FIGURE 4
FIGURE 4
The “total age-transformed” adjustment increases the age prediction accuracy in individual samples by up to 50 years in saliva samples. The absolute increase in accuracy is smaller in samples with higher immune cell count. Very few (5%) samples are predicted less accurately (left to the red vertical line) after the adjustment. Black line is the ordinary least squares regression, its R 2 is 0.72 and Pearson’s r is −0.85.

Similar articles

Cited by

References

    1. Aps J. K. M., Van den Maagdenberg K., Delanghe J. R., Martens L. C. (2002). Flow Cytometry as a New Method to Quantify the Cellular Content of Human Saliva and its Relation to Gingivitis. Clinica Chim. Acta Int. J. Clin. Chem. 321, 35–41. 10.1016/s0009-8981(02)00062-1 - DOI - PubMed
    1. Bischof E., Scheibye-Knudsen M., Siow R., Moskalev A. (2021). Longevity Medicine: Upskilling the Physicians of Tomorrow. The Lancet Healthy Longevity 2, e187–e188. 10.1016/S2666-7568(21)00024-6 - DOI - PubMed
    1. Bocklandt S., Lin W., Sehl M. E., Sánchez F. J., Sinsheimer J. S., Horvath S., et al. (2011). Epigenetic Predictor of Age. PLoS ONE 6–e14821. 10.1371/journal.pone.0014821 - DOI - PMC - PubMed
    1. Bryan A. D., Magnan R. E., Hooper A. E. C., Harlaar N., Hutchison K. E. (2013). Physical Activity and Differential Methylation of Breast Cancer Genes Assayed from Saliva: a Preliminary Investigation. Ann. Behav. Med. A Publ. Soc. Behav. Med. 45, 89–98. 10.1007/s12160-012-9411-4 - DOI - PMC - PubMed
    1. Chicco D. (2017). Ten Quick Tips for Machine Learning in Computational Biology. BioData Mining 10, 35. 10.1186/s13040-017-0155-3 - DOI - PMC - PubMed

LinkOut - more resources