Interpretable discriminant analysis for functional data supported on random nonlinear domains with an application to Alzheimer's disease

Eardi Lila¹, Wenbo Zhang^{1

2}, Swati Rane Levendovszky³; Alzheimer’s Disease Neuroimaging Initiative

Collaborators, Affiliations

PMID: 39279915
PMCID: PMC11398888
DOI: 10.1093/jrsssb/qkae023

Interpretable discriminant analysis for functional data supported on random nonlinear domains with an application to Alzheimer's disease

Eardi Lila et al. J R Stat Soc Series B Stat Methodol. 2024.

. 2024 Mar 22;86(4):1013-1044.

doi: 10.1093/jrsssb/qkae023. eCollection 2024 Sep.

PMID: 39279915
PMCID: PMC11398888
DOI: 10.1093/jrsssb/qkae023

Abstract

We introduce a novel framework for the classification of functional data supported on nonlinear, and possibly random, manifold domains. The motivating application is the identification of subjects with Alzheimer's disease from their cortical surface geometry and associated cortical thickness map. The proposed model is based upon a reformulation of the classification problem as a regularized multivariate functional linear regression model. This allows us to adopt a direct approach to the estimation of the most discriminant direction while controlling for its complexity with appropriate differential regularization. Our approach does not require prior estimation of the covariance structure of the functional predictors, which is computationally prohibitive in our application setting. We provide a theoretical analysis of the out-of-sample prediction error of the proposed model and explore the finite sample performance in a simulation setting. We apply the proposed method to a pooled dataset from Alzheimer's Disease Neuroimaging Initiative and Parkinson's Progression Markers Initiative. Through this application, we identify discriminant directions that capture both cortical geometric and thickness predictive features of Alzheimer's disease that are consistent with the existing neuroscience literature.

Keywords: functional classification; manifold data analysis; neuroimaging; shape data analysis.

PubMed Disclaimer

Conflict of interest statement

Conflict of interest: None declared.

Figures

**Figure 1.**
Panel a: FoSs of three subjects in the training sample, where $g_{i} \in {‘ C ’, ‘ AD ’}$ denotes the disease state of the ith individual (C, Control; AD, Alzheimer’s Disease), $M_{i}$ is a two-dimensional manifold encoding the geometry of the cerebral cortex, and $z_{i} : M_{i} \to R$ is a real function, supported on $M_{i}$ , describing cortical thickness (in mm). Panel b: Linear representation $(v_{i}, x_{i})$ of each FoS $(M_{i}, z_{i})$ shown in Panel a. Here $v_{i} : R^{3} \to R^{3}$ is a vector-valued function encoding the geometry of the ith individual. This is depicted as a collection of 3D vectors ${v_{i} (p_{j})}$ for a dense set of points ${p_{j}} \subset R^{3}$ . For clarity, the function $v_{i}$ is displayed only on half of its domain $R^{3}$ . The function $x_{i} : M \to R$ describes the spatially normalized thickness map of the ith individual on the fixed template $M$ . Panel c: FoS $(φ_{v_{i}} (M), x_{i} \circ φ_{v_{i}}^{- 1})$ parametrized by the associated functions $(v_{i}, x_{i})$ in Panel b. This is a close approximation of the FoS $(M_{i}, z_{i})$ in Panel a.

**Figure 2.**
On the left-hand side, we show an element of the FE basis ${ψ_{l} : M_{T} \to R, l = 1, \dots, s}$ . This is a scalar affine function within each triangle of the mesh $M_{T}$ that takes value 1 on a fixed vertex and value 0 on every other vertex. On the right-hand side, we show an element of the basis ${\int_{R^{3}} K_{R^{3}} (p, \cdot) v_{i} (p) d p, i = 1, \dots, n}$ . This is a smooth vector-valued function from $R^{3}$ to $R^{3}$ .

**Figure 3.**
On the left-hand side, we show the most discriminant geometric and thickness directions as estimated from the linear representations ${(v_{i} - \bar{v}, x_{i} - \bar{x})}$ . These are a vector field ${\hat{β}}^{G} : R^{3} \to R^{3}$ , representing the most predictive geometric pattern of AD, and a function ${\hat{β}}^{F} : M \to R$ , representing the most predictive cortical thickness pattern of AD. For a new FoS, with linear representation $(v^{*}, x^{*})$ , we compute the score $⟨ v^{*} - \bar{v}, {\hat{β}}^{G} ⟩ + ⟨ x^{*} - \bar{x}, {\hat{β}}^{F} ⟩$ and predict whether the subject has AD by comparing the score value with a predetermined threshold $c^{th}$ . On the right-hand side, we depict the process of mapping back the estimates ${\hat{β}}^{G}$ and ${\hat{β}}^{F}$ to the space of FoSs. On the same space, we also pictorially map the classification rule adopted. In the ${\hat{β}}^{F}$ figure, the blue regions represent the areas of the cortical surface where a thinner cortex, relative to the population average, is indicative of AD. These are mostly localized in the lateral temporal, entorhinal, inferior parietal, precuneus, and posterior cingulate cortices. The red arrows in the ${\hat{β}}^{G}$ figure represent the regions where differences in the morphological configuration of the cerebral cortex, compared to the population average, are most predictive of AD. The specific types of morphological changes can be inspected by comparing the surfaces $φ_{\bar{v} - c_{1} {\hat{β}}^{G}} (M)$ and $φ_{\bar{v} + c_{1} {\hat{β}}^{G}} (M)$ , on the right hand side diagram.

**Figure 4.**
On the left side, we show the discriminant direction derived from applying a ridge logistic regression model to the thickness maps. In the centre, we show the discriminant direction resulting from fitting the proposed model in equation (10) to the thickness maps. Although it does not account for subject-specific geometric variations, this model enforces smoothness. On the right side, we have the cortical thickness discriminant direction obtained by fitting the model in equation (15), which explicitly accounts for inter-subject geometric differences. The results of the logistic regression are more difficult to interpret due to the high spatial variability. The model in equation (10) provides more interpretable results thanks to its smoothness penalty, but suggests that a *thicker* cortex in the red areas is indicative of AD, which is not physiologically plausible. When we explicitly model geometric differences, this evidence seems to disappear. This suggests that there is a non-negligible dependence structure between the predictors modelling geometry and those modelling thickness. Differences that seemed to be related to cortical thickness in the model without the geometric component are now captured by the term that models cortical geometric variations. Furthermore, when we model inter-subject geometric differences the entorhinal cortex atrophy in the medial temporal lobe is identified as the strongest predictor of AD. This is consistent with pathological findings and staging of early AD (Braak et al., 2006).

**Figure A1.**
Results of the simulation study to assess the performance of our proposed method, under the assumption of homogeneous covariances, for various sample sizes ( $n = 128, 256, 512, 1024$ ) and signal-to-noise ratios ( $α = 0.2, 0.4, 0.6$ ), where α reflects the strength of the discriminant signal. Prediction accuracy is measured using AUC and the simulations were repeated 50 times for each setting.

**Figure A2.**
Results of the simulation study for heterogeneous covariance structures across different sample sizes ( $n = 128, 256, 512, 1024$ ) and signal-to-noise ratios ( $α = 0.2, 0.4, 0.6$ ), where α reflects the strength of the discriminant signal. The prediction accuracy was evaluated through AUC and the simulations were repeated 50 times for each setting.

**Figure A3.**
Results of the simulation study to compare the performance of the different linear methods considered, using homogeneous covariances, for various sample sizes ( $n = 128, 256, 512, 1024$ ) and signal-to-noise ratios ( $α = 0.2, 0.4, 0.6$ ). Here, we measure the performance using the estimation error $‖ \hat{β} - β^{0} ‖_{L^{2} (M)}^{2}$ , with $\hat{β}$ an appropriately normalized version of the estimate of the true functional parameter $β^{0}$ .

See this image and copyright information in PMC

References

1. Arguillère S., Miller M. I., & Younes L. (2016). Diffeomorphic surface registration with atrophy constraints. SIAM Journal on Imaging Sciences, 9(3), 975–1003. 10.1137/15M104431X - DOI - PMC - PubMed
1. Berlinet A., & Thomas-Agnan C (2004). Reproducing kernel Hilbert spaces in probability and statistics. Springer US. 10.1007/978-1-4419-9096-9 - DOI
1. Berrendero J. R., Cuevas A., & Torrecilla J. L. (2018). On the use of reproducing kernel Hilbert spaces in functional classification. Journal of the American Statistical Association, 113(523), 1210–1218. 10.1080/01621459.2017.1320287 - DOI
1. Biffi C., De Marvao A., Attard M. I., Dawes T. J., Whiffin N., Bai W., Shi W., Francis C., Meyer H., Buchan R., Cook S. A., Rueckert D., & O’Regan D. P. (2018). Three-dimensional cardiovascular imaging-genetics: A mass univariate framework. Bioinformatics, 34(1), 97–103. 10.1093/bioinformatics/btx552 - DOI - PMC - PubMed
1. Blanchard, G., & Krämer, N. (2010). Optimal learning rates for kernel conjugate gradient regression. In Proceedings of the 23rd international conference on neural information processing systems – volume 1 (NIPS'10) (pp. 226–234). Curran Associates Inc., Red Hook, NY, USA. 10.5555/2997189.2997215 - DOI

Grants and funding

LinkOut - more resources

Full Text Sources
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Interpretable discriminant analysis for functional data supported on random nonlinear domains with an application to Alzheimer's disease

Interpretable discriminant analysis for functional data supported on random nonlinear domains with an application to Alzheimer's disease

Abstract

Conflict of interest statement

Figures

References

Grants and funding

LinkOut - more resources

Full Text Sources