Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Aug 20;15(1):30546.
doi: 10.1038/s41598-025-14580-1.

Unsupervised clustering of biochemical markers reveals health profiles associated with function and survival in active aging

Affiliations

Unsupervised clustering of biochemical markers reveals health profiles associated with function and survival in active aging

Raquel González-Martos et al. Sci Rep. .

Abstract

This study explores the relationships between biochemical phenotypes identified using machine learning, and key health outcomes, including body composition, physical function, and mortality risk. Data were collected from 536 physically active Spanish participants aged over 65 years (76.5% women) enrolled in the EXERNET cohort (2017-2018), with a 6-year mortality follow-up. Principal component analysis, and hierarchical and k-means clustering was used to identify distinct biochemical profiles. Associations between clusters and health outcomes were assessed using analysis of covariance and Cox proportional hazards models. Three distinct clusters emerged: 'Healthy', characterized by biochemical values within the normal range and used as the reference group; 'Metabolic', marked by dysregulated metabolic parameters; and 'Hepatic', which exhibited impaired liver function markers. Notably, all clusters showed subclinical levels of dysfunction. The 'Healthy Cluster' demonstrated the highest levels of organized physical activity (90%, p < 0.001), whereas the 'Metabolic Cluster' showed poorer body composition and reduced physical performance. Both the 'Metabolic' and 'Hepatic' clusters demonstrated a higher mortality risk, as confirmed through Cox regression analyses. Adjusted hazard ratios were significantly elevated when considering physical activity and adiposity, with values of 3.45 and 3.71 for the 'Metabolic Cluster', and 3.01 and 3.85 for the 'Hepatic Cluster' (p < 0.05). This study underscores the strong link between metabolic health, physical activity, body composition and 6-years mortality risk in older adults. Machine learning techniques for identifying phenotypic clusters offers a promising tool for early detection and targeted interventions to improve aging outcomes.

Keywords: Aging; Biochemical phenotypes; Body composition; Cluster analysis; Mortality risk; Physical function.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Flowchart of participants included in the analysis.
Fig. 2
Fig. 2
Principal Component Analysis (PCA) heatmap of the blood test profile. The plot represents the contribution of variables onto the principal components which explain X% (First component) and Y% (Second component) of the total variance, respectively. The colour of the points reflects the contribution of each variable to the PCA. K: potassium, Ca: calcium, ALT: alanine aminotransferase, GGT: gamma glutamyltransferase, LDH: lactate dehydrogenase, VLDL: very low-density lipoproteins, TyG index: triglycerides/glucose index, MCV: Mean Corpuscular Volume, MHC: Mean Corpuscular Hemoglobin, MCHC: Mean Corpuscular Hemoglobin Concentration, MPV: Mean Platelet Volume.
Fig. 3
Fig. 3
Biochemical profiles of three phenotypes across four parameter categories. The figure shows the biochemical profiles of the ‘Healthy Cluster’, ‘Hepatic Cluster’, and ‘Metabolic Cluster’ across four categories: (a) metabolic, (b) renal, (c) immunological, and (d) hepatic parameters. The X-axis represents the parameters within each category, and the Y-axis displays the scaled values (− 1 to 1) to enable comparison across clusters. VLDL: very low-density lipoproteins, TyG index: triglycerides/glucose index, K: potassium, Ca: calcium, ALT: alanine aminotransferase, GGT: gamma glutamyl transferase, LDH: lactate dehydrogenase.
Fig. 4
Fig. 4
Survival curves for each biochemical phenotype and gender, and Cox hazard ratios for influencing variables. The figure shows: (a) survival curves for the ‘Healthy’, ‘Metabolic’ and ‘Hepatic’ clusters, (b) by sex, and (c) Cox proportional hazard plot showing the log of the hazard ratio (HR) and 95% confidence intervals (CI) for variables influencing mortality, highlighting significant risk factors for mortality across groups. BMI: Body mass index.

References

    1. United Nations Department of Economic and Social Affairs. World Population Ageing 2019. (2019).
    1. Fernández-Ballesteros, R., Sánchez-Izquierdo, M. & Santacreu, M. Active Aging and Quality of Life. In Handbook of Active Ageing and Quality of Life: From Concepts to Applications (eds Rojo-Pérez, F. & Fernández-Mayoralas, G.) 15–42 (Springer International Publishing, 2021).
    1. You, Y. et al. Inverted U-shaped relationship between sleep duration and phenotypic age in US adults: A population-based study. Sci. Rep.14(1), 1–12. 10.1038/s41598-024-56316-7 (2024). - PMC - PubMed
    1. You, Y. et al. Mediation role of telomere length in the relationship between physical activity and PhenoAge: A population-based study. J. Exerc. Sci. Fit.23(3), 149–156. 10.1016/j.jesf.2025.03.004 (2025). - PMC - PubMed
    1. Zemedikun, D. T., Gray, L. J., Khunti, K., Davies, M. J. & Dhalwani, N. N. Patterns of multimorbidity in middle-aged and older adults: An analysis of the UK biobank data. Mayo Clin. Proc.93(7), 857–866. 10.1016/j.mayocp.2018.02.012 (2018). - PubMed