Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025:15155:24-34.
doi: 10.1007/978-3-031-74561-4_3. Epub 2024 Oct 18.

Spectral Graph Sample Weighting for Interpretable Sub-cohort Analysis in Predictive Models for Neuroimaging

Affiliations

Spectral Graph Sample Weighting for Interpretable Sub-cohort Analysis in Predictive Models for Neuroimaging

Magdalini Paschali et al. Predict Intell Med. 2025.

Abstract

Recent advancements in medicine have confirmed that brain disorders often comprise multiple subtypes of mechanisms, developmental trajectories, or severity levels. Such heterogeneity is often associated with demographic aspects (e.g., sex) or disease-related contributors (e.g., genetics). Thus, the predictive power of machine learning models used for symptom prediction varies across subjects based on such factors. To model this heterogeneity, one can assign each training sample a factor-dependent weight, which modulates the subject's contribution to the overall objective loss function. To this end, we propose to model the subject weights as a linear combination of the eigenbases of a spectral population graph that captures the similarity of factors across subjects. In doing so, the learned weights smoothly vary across the graph, highlighting sub-cohorts with high and low predictability. Our proposed sample weighting scheme is evaluated on two tasks. First, we predict initiation of heavy alcohol drinking in young adulthood from imaging and neuropsychological measures from the National Consortium on Alcohol and NeuroDevelopment in Adolescence (NCANDA). Next, we detect Dementia vs. Mild Cognitive Impairment (MCI) using imaging and demographic measurements in subjects from the Alzheimer's Disease Neuroimaging Initiative (ADNI). Compared to existing sample weighting schemes, our sample weights improve interpretability and highlight sub-cohorts with distinct characteristics and varying model accuracy.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
We first create a population graph using pre-defined sociodemographic, genetic, or environmental factors that share associations with specific brain diseases. Next, a machine learning model is trained to predict symptom outcome given imaging and non-imaging data for each subject. During training, the classification loss is weighted by w, which is a linear combination of the graph eigenbases E with a learnable vector a. The learned weights highlight sub-cohorts that share common characteristics and achieve higher predictive power.
Fig. 2.
Fig. 2.
Comparison of learned weights across cohorts by graph factors for NCANDA and ADNI. The statistical difference of the weights across cohorts is measured with the Mann-Whitney U-test. **:p<0.001, *:p<0.05, ns:p>0.05. The BACC for each sub-cohort is shown under or above the box.
Fig. 3.
Fig. 3.
Impact of choice of neighbors and centering hyperparameter for NCANDA and ADNI. We measure the absolute difference in % of BACC between the cohorts with high vs. low weights to compare the ability of different models to create distinct and highly separable sub-cohorts based on the learned weights.

Similar articles

References

    1. Adeli E, Shi F, An L, Wee CY, Wu G, Wang T: Joint feature-sample selection and robust diagnosis of Parkinson’s disease from MRI data. NeuroImage 141, 206–219 (2016) - PMC - PubMed
    1. Belloy ME, et al.: Association of Klotho-VS heterozygosity with risk of Alzheimer disease in individuals who carry APOE4. JAMA Neurol. 77(7), 849–862 (2020) - PMC - PubMed
    1. Brown SA, et al.: The National Consortium on Alcohol and Neurodevelopment in Adolescence (NCANDA): a multisite study of adolescent development and substance use. J. Stud. Alcohol Drugs 76(6), 895–908 (2015) - PMC - PubMed
    1. Cho K, et al.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078 (2014)
    1. Chung F: Spectral Graph Theory. American Mathematical Society, Providence, R.I (1997)

LinkOut - more resources