Type I and Type II error concerns in fMRI research: re-balancing the scale

Matthew D Lieberman¹, William A Cunningham

Affiliations

PMID: 20035017
PMCID: PMC2799956
DOI: 10.1093/scan/nsp052

Type I and Type II error concerns in fMRI research: re-balancing the scale

Matthew D Lieberman et al. Soc Cogn Affect Neurosci. 2009 Dec.

. 2009 Dec;4(4):423-8.

doi: 10.1093/scan/nsp052. Epub 2009 Dec 24.

Authors

Matthew D Lieberman¹, William A Cunningham

Affiliation

¹ Department of Psychology, Franz Hall, University of California, Los Angeles, CA 90095-1563, USA. lieber@ucla.edu

PMID: 20035017
PMCID: PMC2799956
DOI: 10.1093/scan/nsp052

Abstract

Statistical thresholding (i.e. P-values) in fMRI research has become increasingly conservative over the past decade in an attempt to diminish Type I errors (i.e. false alarms) to a level traditionally allowed in behavioral science research. In this article, we examine the unintended negative consequences of this single-minded devotion to Type I errors: increased Type II errors (i.e. missing true effects), a bias toward studying large rather than small effects, a bias toward observing sensory and motor processes rather than complex cognitive and affective processes and deficient meta-analyses. Power analyses indicate that the reductions in acceptable P-values over time are producing dramatic increases in the Type II error rate. Moreover, the push for a mapwide false discovery rate (FDR) of 0.05 is based on the assumption that this is the FDR in most behavioral research; however, this is an inaccurate assessment of the conventions in actual behavioral research. We report simulations demonstrating that combined intensity and cluster size thresholds such as P < 0.005 with a 10 voxel extent produce a desirable balance between Types I and II error rates. This joint threshold produces high but acceptable Type II error rates and produces a FDR that is comparable to the effective FDR in typical behavioral science articles (while a 20 voxel extent threshold produces an actual FDR of 0.05 with relatively common imaging parameters). We recommend a greater focus on replication and meta-analysis rather than emphasizing single studies as the unit of analysis for establishing scientific truth. From this perspective, Type I errors are self-erasing because they will not replicate, thus allowing for more lenient thresholding to avoid Type II errors.

PubMed Disclaimer

References

1. Cox RW. AFNI: Software for analysis and visualization of functional magnetic resonance neuroimages. Computers and Biomedical Research. 1996;29:162–73. - PubMed
1. Fisher RA. The arrangement of field experiments. Journal of the Ministry of Agriculture of Great Britain. 1926;33:503–13.
1. Forman SD, Cohen JD, Fitzgerald M, Eddy WF, Mintun MA, Noll DC. Improved assessment of signiﬁcant activation in functional magnetic resonance imaging (fMRI): use of a cluster-size threshold. Magnetic Resonance in Medicine. 1995;33:636–47. - PubMed
1. Genovese CR, Lazar NA, Nichols TE. Thresholding of statistical maps in functional neuroimaging using the false discovery rate. Neuroimage. 2002;15:870–8. - PubMed
1. Griffin DW, Ross L. Subjective construal, social inference, and human misunderstanding. Advances in Experimental Social Psychology. 1991;24:319–59.

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Type I and Type II error concerns in fMRI research: re-balancing the scale

Affiliation

Type I and Type II error concerns in fMRI research: re-balancing the scale

Authors

Affiliation

Abstract

References

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Medical

Miscellaneous