Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan-Dec;15(1):2226915.
doi: 10.1080/19490976.2023.2226915.

Machine learning model for predicting age in healthy individuals using age-related gut microbes and urine metabolites

Affiliations

Machine learning model for predicting age in healthy individuals using age-related gut microbes and urine metabolites

Seung-Ho Seo et al. Gut Microbes. 2023 Jan-Dec.

Abstract

Age-related gut microbes and urine metabolites were investigated in 568 healthy individuals using metataxonomics and metabolomics. The richness and evenness of the fecal microbiota significantly increased with age, and the abundance of 16 genera differed between the young and old groups. Additionally, 17 urine metabolites contributed to the differences between the young and old groups. Among the microbes that differed by age, Bacteroides and Prevotella 9 were confirmed to be correlated with some urine metabolites. The machine learning algorithm eXtreme gradient boosting (XGBoost) was shown to produce the best performing age predictors, with a mean absolute error of 5.48 years. The accuracy of the model improved to 4.93 years with the inclusion of urine metabolite data. This study shows that the gut microbiota and urine metabolic profiles can be used to predict the age of healthy individuals with relatively good accuracy.

Keywords: Age; prediction; urine; fecal; metabolomics; metataxonomics.

PubMed Disclaimer

Conflict of interest statement

No potential conflict of interest was reported by the author(s).

Figures

Figure 1.
Figure 1.
Analysis of gut microbial community profiles according to age. The samples were divided into three groups (young: 20–39 years, middle-aged: 40–59 years, and old: ≥60 years). (a) Relative abundance (%) of bacteria by ascending age groups at the phylum level. (b) Principal component analysis (PCA) of beta-diversity based on the operational taxonomic unit (OTU) level (unweighted UniFrac). (c) Alpha-diversities of microbial communities between young (yellow), middle-aged (blue), and old (red) groups (p value was calculated using a Kruskal–Wallis test; *p < .1, **p < .01, ***p < .001). (d, e) Violin plots of relative abundance of bacterial taxa (genus level) that contribute to differences in the linear discriminant analysis effect size (LEfSe) (LDA >2.0 and p < .05) between young and old groups. (d) Relatively higher abundance in the young group. (e) Relatively higher abundance in the old group. p value was calculated using a Kruskal–Wallis test; *p < .1, **p < .01, ***p < .001.
Figure 2.
Figure 2.
Orthogonal partial least squares discriminant analysis (OPLS-DA) score plot obtained from gas chromatography mass spectrometry (GC-MS) data of urine. The samples were divided into three groups (young: 20–39 years, middle-aged: 40–59 years, and old: ≥60 years). (a) OPLS-DA score plot of young (yellow), middle-aged (blue), and old (red) groups. (b) OPLS-DA score plot of young (yellow) and old (red) groups. Cross validation was performed using a permutation test that was repeated 200 times. No over-fitting was observed. (c-e) Violin plots of identified metabolites that contribute to differentiation between the young and old groups in the OPLS-DA model (VIP >1.0 and p < .05). (c) Relatively higher intensity in the young group. (d) Relatively higher intensity in the middle-aged group. (e) Relatively higher intensity in the old group. p value was calculated using a Kruskal–Wallis test; *p < .1, **p < .01, ***p < .001.
Figure 3.
Figure 3.
Data integration analysis for biomarker discovery using latent components (DIABLO) graphical outputs on the Korean age study. (a) Circos plot of correlations between urine metabolites and fecal microorganisms. Each dataset is assigned a different color: metabolites are in blue blocks and microorganisms in green blocks. Red and blue lines indicate positive and negative correlations between two variables, respectively (r ≥ |0.50|). (b) Network visualization of urine metabolites and fecal microorganisms derived from circos plots. Each dataset is assigned a different color: metabolites are in blue circles and microorganisms in green circles. Red and green lines indicate positive and negative correlations between two variables, respectively.
Figure 4.
Figure 4.
Linear regression scatter plot (Pearson correlation) of predicted age versus actual age based on gut microorganisms and urine metabolites (R2 = 0.79, p < .001). The x-axis shows the predicted age of the volunteers in years. The y-axis shows the actual age of the volunteers in years. Every blue dot displays one sample. The dotted line shows the linear correlation. R2 = coefficient of determination, MAE = mean absolute error in years.
Figure 5.
Figure 5.
Sample characteristics and workflow of young, middle-aged, and old groups.

Similar articles

Cited by

References

    1. Rae MJ, Butler RN, Campisi J, Grey ADNJ D, Finch CE, Gough M, Martin GM, Vijg J, Perrott KM, Logan BJ.. The demographic and biomedical case for late-life interventions in aging. Sci Transl Med. 2010;2(40):40cm21. doi:10.1126/scitranslmed.30008. - DOI - PMC - PubMed
    1. Jylhävä J, Pedersen NL, Hägg S. Biological age predictors. EBioMedicine. 2017;21:29–14. doi:10.1016/j.ebiom.2017.03.046. - DOI - PMC - PubMed
    1. Blackburn EH, Greider CW, Szostak JW. Telomeres and telomerase: the path from maize, Tetrahymena and yeast to human cancer and aging. Nat Med. 2006;12(10):1133–1138. doi:10.1038/nm1006-1133. - DOI - PubMed
    1. Horvath S. DNA methylation age of human tissues and cell types. Genome Biol. 2013;14(10):1–20. doi:10.1186/gb-2013-14-10-r115. - DOI - PMC - PubMed
    1. Peters MJ, Joehanes RJ, Pilling LC, Schurmann C, Conneely KN, Powell J, Reinmaa E, Sutphin GL, Zhernakova A, Schramm K, et al. The transcriptional landscape of age in human peripheral blood. Nat Commun. 2015;6(1):1–14. doi:10.1038/ncomms9570. - DOI - PMC - PubMed

Publication types

Substances

LinkOut - more resources