Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 May;8(5):1021-33.
doi: 10.18632/aging.100968.

Deep biomarkers of human aging: Application of deep neural networks to biomarker development

Affiliations

Deep biomarkers of human aging: Application of deep neural networks to biomarker development

Evgeny Putin et al. Aging (Albany NY). 2016 May.

Abstract

One of the major impediments in human aging research is the absence of a comprehensive and actionable set of biomarkers that may be targeted and measured to track the effectiveness of therapeutic interventions. In this study, we designed a modular ensemble of 21 deep neural networks (DNNs) of varying depth, structure and optimization to predict human chronological age using a basic blood test. To train the DNNs, we used over 60,000 samples from common blood biochemistry and cell count tests from routine health exams performed by a single laboratory and linked to chronological age and sex. The best performing DNN in the ensemble demonstrated 81.5 % epsilon-accuracy r = 0.90 with R(2) = 0.80 and MAE = 6.07 years in predicting chronological age within a 10 year frame, while the entire ensemble achieved 83.5% epsilon-accuracy r = 0.91 with R(2) = 0.82 and MAE = 5.55 years. The ensemble also identified the 5 most important markers for predicting human chronological age: albumin, glucose, alkaline phosphatase, urea and erythrocytes. To allow for public testing and evaluate real-life performance of the predictor, we developed an online system available at http://www.aging.ai. The ensemble approach may facilitate integration of multi-modal data linked to chronological age and sex that may lead to simple, minimally invasive, and affordable methods of tracking integrated biomarkers of aging in humans and performing cross-species feature importance analysis.

Keywords: aging biomarkers; biomarker development; deep learning; deep neural networks; human aging; machine learning.

PubMed Disclaimer

Conflict of interest statement

Conflict of interest statement

The authors are affiliated with Insilico Medicine, Inc, a commercial company developing differential pathway activation scoring-based and deep learned biomarkers of multiple diseases and aging and engaging in drug discovery and drug repurposing. The company has developed a range of drug candidates addressing specific diseases and geroprotector interventions addressing human aging processes that need to be validated in human patients. The company intends to use blood biochemistry and multi-parametric markers, including the one published in this paper to test the efficacy of these compounds. Despite company's commitment to best academic practices and in silico veritas, the authors may have a conflict of interest.

Figures

Figure 1
Figure 1. Project pipeline
Laboratory blood biochemistry data sets were normalized and cleaned of outliers and some abnormal markers. For biological age prediction, 21 different DNNs with different parameters were combined in ensemble based on ElasticNet model. For biological sex prediction, single DNN were trained.
Figure 2
Figure 2. Analysis of best DNN model in the ensemble and the whole ensemble
(A) Correlation between actual and predicted age values by the best DNN in the ensemble. (B) Biological age epsilon-prediction accuracy plot for the best DNN. (C) Biological age marker Importance, performed using FPI method. (D) Correlation between actual and predicted age values by whole ensemble based on ElasticNet model. (E) Biological age epsilon-prediction accuracy plot for the ensemble. (F) Heat map for Pearson's correlation coefficients between 40 DNNs. Scale bar colors indicate the sign and magnitude of Pearson's correlation coefficient between predictions of DNNs.
Figure 3
Figure 3. DNNs outperform baseline ML approaches in terms of R2 statistics
DNN were compared with 7 ML techniques: GBM (Gradient Boosting Machine), RF (Random Forests), DT (Decision Trees), LR (Linear Regression), kNN (k-Nearest Neighbors), ElasticNet, SVM (Support Vector Machines). (A) GBM shows the higher 0,72 R2 among ML models for biological age prediction. (B) All ML models have comparable high R2 for biological sex prediction.
Figure 4
Figure 4. Comparison of sub-models for stacking ensemble and evaluation of filling strategies
(A) ElasticNet model has the higher epsilon-prediction accuracy among the stacking models. (B) ElasticNet is the best model for stacking from the point of R2 statistics. (C) Median filling strategy has higher epsilon-prediction accuracy than other strategies. Median filling strategy shows 64,5 % epsilon accuracy within 10 years frame. (D) Median filling strategy is better from the point of R2 statistics.
Figure 5
Figure 5. Top features analysis
(A) Dependence of the epsilon-prediction accuracy from the number of features. (B) Dependence of R2 statistics from the number of features.

Comment in

  • Deep biomarkers of aging are population-dependent.
    Cohen AA, Morissette-Thomas V, Ferrucci L, Fried LP. Cohen AA, et al. Aging (Albany NY). 2016 Sep 8;8(9):2253-2255. doi: 10.18632/aging.101034. Aging (Albany NY). 2016. PMID: 27622833 Free PMC article. No abstract available.

References

    1. Zhavoronkov A, Cantor CR. Methods for structuring scientific knowledge from many areas related to aging research. PLoS One. 2011;6:e22597. - PMC - PubMed
    1. Moskalev A, Zhikrivetskaya S, Shaposhnikov M, Dobrovolskaya E, Gurinovich R, Kuryan O, et al. Aging Chart: a community resource for rapid exploratory pathway analysis of age-related processes. Nucleic Acids Res. 2016;44:D894–899. - PMC - PubMed
    1. Moskalev A, Chernyagina E, de Magalhães JP, Barardo D, Thoppil H, Shaposhnikov M, et al. Geroprotectors.org: a new, structured and curated database of current therapeutic interventions in aging and age-related disease. Aging (Albany NY) 2015;7:616–728. - PMC - PubMed
    1. Zhavoronkov A, Alex Z, Bhupinder B. Classifying Aging as a Disease in the context of ICD-1. Frontiers in Genetics. 2015;6:1–16. - PMC - PubMed
    1. Bürkle A, Moreno-Villanueva M, Bernhard J, Blasco M, Zondag G, Hoeijmakers JHJ, et al. MARK-AGE biomarkers of ageing. Mech. Ageing Dev. 2015;151:2–12. - PubMed

Publication types