Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comment
. 2015 Aug 1:116:248-54.
doi: 10.1016/j.neuroimage.2015.04.032. Epub 2015 Apr 25.

Cross-validation and hypothesis testing in neuroimaging: An irenic comment on the exchange between Friston and Lindquist et al

Affiliations
Comment

Cross-validation and hypothesis testing in neuroimaging: An irenic comment on the exchange between Friston and Lindquist et al

Philip T Reiss. Neuroimage. .

Abstract

The "ten ironic rules for statistical reviewers" presented by Friston (2012) prompted a rebuttal by Lindquist et al. (2013), which was followed by a rejoinder by Friston (2013). A key issue left unresolved in this discussion is the use of cross-validation to test the significance of predictive analyses. This note discusses the role that cross-validation-based and related hypothesis tests have come to play in modern data analyses, in neuroimaging and other fields. It is shown that such tests need not be suboptimal and can fill otherwise-unmet inferential needs.

Keywords: Brain decoding; Cross-validation; Likelihood ratio test; Neyman–Pearson Lemma; Null hypothesis; Permutation test.

PubMed Disclaimer

Figures

Figure 1
Figure 1
(a) Estimate of β obtained by minimizing the Lasso criterion (7) with log(λ) = −6. The rows represent 21 pain-relevant regions, while the columns represents 23 time points. The first dashed line marks the end of the hot stimulus, while the second marks the time at which “How painful?” appeared on the screen. (b) Estimate based on log(λ) = −1.84. (c) Ten-fold cross-validated mean squared error, ± 1 standard error (SE), for a range of log(λ) values. The first dotted line indicates where CV is minimized (log(λ) = −1.84); the second indicates the largest log(λ) at which the CV score is within 1 SE of the minimum. The numbers along the top edge are counts of nonzero coefficient estimates, which decrease as λ increases. (d) Histogram of 999 permuted-data CV scores, with the real-data CV score indicated by the red line.
Figure 2
Figure 2
Power of the CV permutation test (black) compared with that of the benchmark F-test (green), for two cases with p = 2: a one-way ANOVA comparing two groups (above) and a normally distributed covariate (below).

Comment on

References

    1. Abramovich F, Ritov Y. Statistical Theory: A Concise Introduction. Boca Raton, FL: CRC Press; 2013.
    1. Akaike H. Second International Symposium on Information Theory. Akademiai Kiado; 1973. Information theory and an extension of the maximum likelihood principle; pp. 267–281.
    1. Arlot S, Celisse A. A survey of cross-validation procedures for model selection. Statistics Surveys. 2010;4:40–79.
    1. Boulesteix AL, Sauerbrei W. Added predictive value of high-throughput molecular data to clinical data and its validation. Briefings in Bioinformatics. 2011;12:215–229. - PubMed
    1. Breiman L. Statistical modeling: The two cultures (with discussion) Statistical Science. 2001;16:199–231.