Data-driven clustering approach to identify novel clusters of high cognitive impairment risk among Chinese community-dwelling elderly people with normal cognition: A national cohort study
- PMID: 38638099
- PMCID: PMC11026990
- DOI: 10.7189/jogh.14.04088
Data-driven clustering approach to identify novel clusters of high cognitive impairment risk among Chinese community-dwelling elderly people with normal cognition: A national cohort study
Abstract
Background: Cognitive impairment is a highly heterogeneous disorder that necessitates further investigation into the distinct characteristics of populations at varying risk levels of cognitive impairment. Using a large-scale registry cohort of elderly individuals, we applied a data-driven approach to identify novel clusters based on diverse sociodemographic features.
Methods: A prospective cohort of 6398 elderly people from the Chinese Longitudinal Healthy Longevity Survey, followed between 2008-14, was used to develop and validate the model. Participants were aged ≥60 years, community-dwelling, and the Chinese version of the Mini-Mental State Examination (MMSE) score ≥18 were included. Sixty-nine sociodemographic features were included in the analysis. The total population was divided into two-thirds for the derivation cohort (n = 4265) and one-third for the validation cohort (n = 2133). In the derivation cohort, an unsupervised Gaussian mixture model was applied to categorise participants into distinct clusters. A classifier was developed based on the most important 10 factors and was applied to categorise participants into their corresponding clusters in a validation cohort. The difference in the three-year risk of cognitive impairment was compared across the clusters.
Results: We identified four clusters with distinct features in the derivation cohort. Cluster 1 was associated with the worst life independence, longest sleep duration, and the oldest age. Cluster 2 demonstrated the highest loneliness, characterised by non-marital status and living alone. Cluster 3 was characterised by the lowest sense of loneliness and the highest proportions in marital status and family co-residence. Cluster 4 demonstrated heightened engagement in exercise and leisure activity, along with independent decision-making, hygiene, and a diverse diet. In comparison to Cluster 4, Cluster 1 exhibited the highest three-year cognitive impairment risk (adjusted odds ratio (aOR) = 3.31; 95% confidence interval (CI) = 1.81-6.05), followed by Cluster 2 and Cluster 3 after adjustment for baseline MMSE, residence, sex, age, years of education, drinking, smoking, hypertension, diabetes, heart disease and stroke or cardiovascular diseases.
Conclusions: A data-driven approach can be instrumental in identifying individuals at high risk of cognitive impairment among cognitively normal elderly populations. Based on various sociodemographic features, these clusters can suggest individualised intervention plans.
Copyright © 2024 by the Journal of Global Health. All rights reserved.
Conflict of interest statement
Disclosure of interest: The authors completed the ICMJE Disclosure of Interest Form (available upon request from the corresponding author) and disclose no relevant interests.
Figures




Similar articles
-
A Risk Prediction Model Based on Machine Learning for Cognitive Impairment Among Chinese Community-Dwelling Elderly People With Normal Cognition: Development and Validation Study.J Med Internet Res. 2021 Feb 24;23(2):e20298. doi: 10.2196/20298. J Med Internet Res. 2021. PMID: 33625369 Free PMC article.
-
Association between tooth loss and cognitive impairment in community-dwelling older Japanese adults: a 4-year prospective cohort study from the Ohasama study.BMC Oral Health. 2018 Aug 20;18(1):142. doi: 10.1186/s12903-018-0602-7. BMC Oral Health. 2018. PMID: 30126407 Free PMC article.
-
Association of APOE ε4 genotype and lifestyle with cognitive function among Chinese adults aged 80 years and older: A cross-sectional study.PLoS Med. 2021 Jun 1;18(6):e1003597. doi: 10.1371/journal.pmed.1003597. eCollection 2021 Jun. PLoS Med. 2021. PMID: 34061824 Free PMC article.
-
Leisure activities, education, and cognitive impairment in Chinese older adults: a population-based longitudinal study.Int Psychogeriatr. 2017 May;29(5):727-739. doi: 10.1017/S1041610216001769. Epub 2017 Jan 9. Int Psychogeriatr. 2017. PMID: 28067190 Free PMC article.
-
Using Machine Learning to Predict Cognitive Decline in Older Adults From the Chinese Longitudinal Healthy Longevity Survey: Model Development and Validation Study.JMIR Aging. 2025 Apr 30;8:e67437. doi: 10.2196/67437. JMIR Aging. 2025. PMID: 40305830 Free PMC article.
Cited by
-
A systematic exposure-wide framework leveraging machine learning to identify multidomain exposure factors and their joint influence on cognitive function: Evidence from a neurological cohort.Alzheimers Dement. 2025 Feb;21(2):e14624. doi: 10.1002/alz.14624. Alzheimers Dement. 2025. PMID: 39998468 Free PMC article.
-
Association Between Lymphocyte-High Density Lipoprotein Ratio and Cognitive Impairment in Chinese Older Adults: A Population-Based Cross-Section Study.Am J Alzheimers Dis Other Demen. 2025 Jan-Dec;40:15333175251361748. doi: 10.1177/15333175251361748. Epub 2025 Jul 30. Am J Alzheimers Dis Other Demen. 2025. PMID: 40736399 Free PMC article.
References
-
- National Bureau of Statistics of the People’s Republic of China. Interpretation of the seventh national census. 2021. Available: http://www.stats.gov.cn/zt_18555/zdtjgz/zgrkpc/dqcrkpc/. Accessed: 26 November 2023.
MeSH terms
LinkOut - more resources
Full Text Sources
Medical