Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Sep;44(13):4738-4753.
doi: 10.1002/hbm.26413. Epub 2023 Jul 7.

Using predictive validity to compare associations between brain damage and behavior

Affiliations

Using predictive validity to compare associations between brain damage and behavior

John F Magnotti et al. Hum Brain Mapp. 2023 Sep.

Abstract

Lesion-behavior mapping (LBM) provides a statistical map of the association between voxel-wise brain damage and individual differences in behavior. To understand whether two behaviors are mediated by damage to distinct regions, researchers often compare LBM weight outputs by either the Overlap method or the Correlation method. However, these methods lack statistical criteria to determine whether two LBM are distinct versus the same and are disconnected from a major goal of LBMs: predicting behavior from brain damage. Without such criteria, researchers may draw conclusions from numeric differences between LBMs that are irrelevant to predicting behavior. We developed and validated a predictive validity comparison method (PVC) that establishes a statistical criterion for comparing two LBMs using predictive accuracy: two LBMs are distinct if and only if they provide unique predictive power for the behaviors being assessed. We applied PVC to two lesion-behavior stroke data sets, demonstrating its utility for determining when behaviors arise from the same versus different lesion patterns. Using region-of-interest-based simulations derived from proportion damage from a large data set (n = 131), PVC accurately detected when behaviors were mediated by different regions (high sensitivity) versus the same region (high specificity). Both the Overlap method and Correlation method performed poorly on the simulated data. By objectively determining whether two behavioral deficits can be explained by single versus distinct patterns of brain damage, PVC provides a critical advance in establishing the brain bases of behavior. We have developed and released a GUI-driven web app to encourage widespread adoption.

Keywords: lesion-behavior mapping; multivariate analysis; neuropsychology; stroke.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest/competing interests.

Figures

FIGURE 1
FIGURE 1
Model building phase in the predictive validity comparison (PVC) method. (a) Normalized, segmented lesion volumes for all participants (S1 to S n; purple denotes lesioned voxels). (b) Center and scale (z‐scored) behavioral scores (B1, B2) prior to fitting for S1 to S n (B1Z 1, blue; B2Z 2, orange). (c) Under the null hypothesis (H0), a single LBM is fit using a single behavior score, Z 0 (z‐scored average of Z 1 and Z 2, left) and subject's lesion volume (middle), producing a coefficient for each voxel (scaled from 0 to 1 for display purposes, transparent white to opaque red; right). (d) Under the alternative hypothesis (HA), distinct LBMs are fit using participants' lesion volumes paired with each scaled behavior Z 1 (LBM1) and Z2 (LBM2).
FIGURE 2
FIGURE 2
Model comparison in the predictive validity comparison (PVC) method. (a) Participants' lesion volumes are multiplied by each fitted lesion‐behavior map (LBM) to generate predicted values. (b) Under the null hypothesis (H0) there is a single LBM, producing a single prediction (Z^0) for the two actual scaled behaviors (Z1,Z2). (c) Predictions are compared with the actual scaled behaviors using AIC (solid line indicates equality between predicted and actual values). The total AIC (inset solid bar) for H0 calculated. (d) Under the alternative hypothesis (HA), there are distinct LBMs (LBM1, LBM2), producing distinct predictions (Z^1, Z^2) for the two actual scaled behaviors. (e) Predictions are compared with actual behaviors and total AIC for HA calculated (inset). (g) AIC difference (H0 − HA) determines the winning hypothesis: positive values favor HA; negative values favor H0. The gray dashed line at +10:1 is the cutoff for claiming decisive support for HA.
FIGURE 3
FIGURE 3
Comparing lesion‐behavior maps (LBMs) built from the MRRI data. (a) Scatter plot showing the strong linear relationship (r = 0.89) between the Western Aphasia Battery Aphasia Quotient (AQ) and the Philadelphia naming task (% accuracy) for 130 participants. (b) Fitted voxel weights (max‐scaled across maps; transparent white to opaque red) from LBMs fitted under the null hypothesis (LBM0) and alternative hypothesis (LBM1 and LBM2). The axial slice shown (z = 107) had the most nonzero voxel weights for both LBM1 and LBM2. LH: Left Hemisphere. (c)/(d) The PVC method compares the actual behavior to predictions generated under the null hypothesis (H0) and the alternative hypotheses (HA). Solid diagonal line indicates perfect prediction. (e) The AIC difference was decisive for H0 (cutoff at −10, gray dashed line). (f) The Overlap method highlights slices that differ between the LBMs. Voxel color indicates the sign and magnitude of the difference (opaque blue to transparent white to opaque red; max‐scaled). The dice coefficient of 0.77 suggests a moderate to high overlap between the maps. (g) The Correlation method suggested a moderate relationship (r = 0.64) between the weights in LBM1 and LBM2.
FIGURE 4
FIGURE 4
Comparing lesion‐behavior maps (LBMs) built from the Schnur Laboratory data. (a) Scatter plot showing no linear relationship between z‐scored words per min (speech rate) and z‐scored relative percentage of pronouns to nouns produced for 52 participants. (b) Fitted voxel weights (max‐scaled across maps; transparent white to opaque red) from LBMs fitted under the null hypothesis (LBM0) and alternative hypothesis (LBM1 and LBM2). The axial slices shown (z = 84, 113) had the most nonzero voxel weights for LBM1 and LBM2, respectively. LH: Left Hemisphere. Yellow arrows note areas of distinction across the slices. (c)/(d) The PVC method compares the actual behavior to predictions generated under the null hypothesis (H0) and the alternative hypotheses (HA). Solid diagonal line indicates perfect prediction. (e) The AIC difference (H0 − HA = 1143) used by the PVC method was decisive for HA (cutoff at +10, gray dashed line). (f) The Overlap method highlights slices that differ between the LBMs. Voxel color indicates the sign and magnitude of the difference (opaque blue to transparent white to opaque red; max‐scaled). The Dice coefficient of 0.52 suggests a moderate overlap between the maps. (g) The Correlation method assesses the relationship between the weights in LBM1 and LBM2, suggesting a weak but significant positive correlation (r = 0.11, p = 10−16).
FIGURE 5
FIGURE 5
Effect of varying SCCAN sparseness parameter and directionality of voxel weights on lesion‐behavior map (LBM) comparison results using real data. More sparse models have fewer voxels in the resulting LBM; number of nonzero voxels in the fitted LBMs are plotted on the horizontal axis (sparsity) and voxel‐weight directionality is shown in separate lines (pink: directional weights allowed; black: non‐negative weights only). The dashed gray line provides the threshold for determining same versus different LBMs. (a) For the MRRI dataset (H0), changing the sparsity of the result had quantitative impacts on the AIC difference used by the predictive validity comparison (PVC) method (Left), but no change in the conclusion (all simulations decisively supported the null hypothesis). For the Dice coefficient used by the Overlap method (second column) and the correlation‐based methods (third and fourth columns), the sparseness of the model modulated the quantitative metrics, but there was little impact of voxel‐weight directionality on the results. Across much of the range, all methods supported the same conclusion (maps are the same), except for sparse maps using the ROI correlation method. (b) For the Schnur Lab dataset (HA), the PVC method again showed quantitative differences across parameter manipulations, but no decisional change (all simulations favored the alternative hypothesis). The Dice coefficient and the ROI‐level correlation were inconsistent and strongly dependent on the choice of sparseness and voxel directionality; the voxel‐wise correlation showed consistent support for H0.
FIGURE 6
FIGURE 6
Simulation results. (a) Averaged accuracy from the different‐region simulations provide the sensitivity (% correct for judging that two behaviors have different neural bases) for each method (PVC: Blue; Voxel‐wise correlation: pale orange; ROI‐level correlation: orange; Overlap method/Dice coefficient: green), binned by the between‐region damage correlation. Each point represents the average sensitivity across a varying number of simulations. (b) Relative sensitivity versus PVC for each method (same colors as panel a). Only bins with >50% sensitivity are displayed. The dashed line at 0 represents the sensitivity of the PVC method (blue line in panel a). (c) The number of simulations within each bin (total simulation count = 765). This distribution was determined by the lesion distribution in the MRRI dataset.

References

    1. Alyahya, R. S. W. , Halai, A. D. , Conroy, P. , & Ralph, M. A. L. (2021). Content word production during discourse in aphasia: Deficits in word quantity, not lexical–semantic complexity. Journal of Cognitive Neuroscience, 33(12), 2494–2511. 10.1162/jocn_a_01772 - DOI - PMC - PubMed
    1. Arbula, S. , Ambrosini, E. , della Puppa, A. , de Pellegrin, S. , Anglani, M. , Denaro, L. , Piccione, F. , D'Avella, D. , Semenza, C. , Corbetta, M. , & Vallesi, A. (2020). Focal left prefrontal lesions and cognitive impairment: A multivariate lesion‐symptom mapping approach. Neuropsychologia, 136, 107253. 10.1016/j.neuropsychologia.2019.107253 - DOI - PubMed
    1. Avants, B. B. (2015). Advanced Normalization Tools for R. ⟨http://stnava.github.io/ANTsR⟩ http://stnava.github.io/ANTsR⟩
    1. Avants, B. B. , Epstein, C. L. , Grossman, M. , & Gee, J. C. (2008). Symmetric diffeomorphic image registration with cross‐correlation: Evaluating automated labeling of elderly and neurodegenerative brain. Medical Image Analysis, 12, 26–41. - PMC - PubMed
    1. Baldo, J. V. , Wilson, S. M. , & Dronkers, N. F. (2012). Uncovering the neural substrates of language: A voxel‐based lesion‐symptom mapping approach. In The handbook of the neuropsychology of language (pp. 582–594). John Wiley & Sons. 10.1002/9781118432501.ch28 - DOI

Publication types