Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Mar;76(1):257-269.
doi: 10.1111/biom.13123. Epub 2019 Sep 30.

Distance-based analysis of variance for brain connectivity

Affiliations

Distance-based analysis of variance for brain connectivity

Russell T Shinohara et al. Biometrics. 2020 Mar.

Abstract

The field of neuroimaging dedicated to mapping connections in the brain is increasingly being recognized as key for understanding neurodevelopment and pathology. Networks of these connections are quantitatively represented using complex structures, including matrices, functions, and graphs, which require specialized statistical techniques for estimation and inference about developmental and disorder-related changes. Unfortunately, classical statistical testing procedures are not well suited to high-dimensional testing problems. In the context of global or regional tests for differences in neuroimaging data, traditional analysis of variance (ANOVA) is not directly applicable without first summarizing the data into univariate or low-dimensional features, a process that might mask the salient features of high-dimensional distributions. In this work, we consider a general framework for two-sample testing of complex structures by studying generalized within-group and between-group variances based on distances between complex and potentially high-dimensional observations. We derive an asymptotic approximation to the null distribution of the ANOVA test statistic, and conduct simulation studies with scalar and graph outcomes to study finite sample properties of the test. Finally, we apply our test to our motivating study of structural connectivity in autism spectrum disorder.

Keywords: biostatistics; distance statistics; kernel ANOVA; neuroimaging.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
Observed data for two selected subjects, consisting of volume-normalized counts of streamline connections between each pair of regions. Regions are represented spatially in sagittal, coronal, and axial views as red dots, and blue lines are connections. Darker and wider blue lines indicate stronger connections between regions
FIGURE 2
FIGURE 2
Mean networks (first and second rows) for subjects younger than the median age (12.2 years) compared to older subjects. Regions are represented as red dots, and connections are shown as blue lines. Thicker lines indicate stronger connections on average between regions, and the legend indicates the number of streamline connections estimated. In the third row, the differences between the maps are shown with blue lines indicating stronger connections in older subjects, and red lines indicating weaker connections in these groups
FIGURE 3
FIGURE 3
Q-Q plots comparing the proposed approximation to the empirical null distribution of the squared Euclidean (top) and absolute (bottom) distance-based ANOVA test statistic Qn in the scalar case. ANOVA, analysis of variance; Q-Q, quantile-quantile
FIGURE 4
FIGURE 4
Graph-outcome simulation design. On the left, the adjacency matrix for the simulated graphs is shown, and on the right three example subjects are shown with τ0 =5% for the first population and τ1 =10% for the second
FIGURE 5
FIGURE 5
Figures showing the type I error rates and power for various settings with network outcomes in scenario 4. The top row shows type I error rates for several noise levels and power under several alternatives for the case of π0 = 1/2, and the bottom rows show results for the case of imbalanced group sizes

References

    1. Anderson TW (1963) Asymptotic theory for principal component analysis. Annals of Mathematical Statistics, 34(1), 122–148.
    1. Behrens T, Woolrich M, Jenkinson M, Johansen-Berg H, Nunes R, Clare S et al. (2003) Characterization and propagation of uncertainty in diffusion-weighted mr imaging. Magnetic Resonance in Medicine, 50(5), 1077–1088. - PubMed
    1. Cuevas A, Febrero M and Fraiman R (2004) An anova test for functional data. Computational Statistics and Data Analysis, 47(1), 111–122.
    1. Fan J and Fan Y (2008) High dimensional classification using features annealed independence rules. Annals of Statistics, 36(6), 2605. - PMC - PubMed
    1. Fan J, Han F and Liu H (2014) Challenges of big data analysis. National Science Review, 1(2), 293–314. - PMC - PubMed

Publication types