Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Oct;29(5):1751-1775.
doi: 10.3758/s13423-022-02098-w. Epub 2022 May 2.

Theoretical false positive psychology

Affiliations
Review

Theoretical false positive psychology

Brent M Wilson et al. Psychon Bull Rev. 2022 Oct.

Abstract

A fundamental goal of scientific research is to generate true positives (i.e., authentic discoveries). Statistically, a true positive is a significant finding for which the underlying effect size (δ) is greater than 0, whereas a false positive is a significant finding for which δ equals 0. However, the null hypothesis of no difference (δ = 0) may never be strictly true because innumerable nuisance factors can introduce small effects for theoretically uninteresting reasons. If δ never equals zero, then with sufficient power, every experiment would yield a significant result. Yet running studies with higher power by increasing sample size (N) is one of the most widely agreed upon reforms to increase replicability. Moreover, and perhaps not surprisingly, the idea that psychology should attach greater value to small effect sizes is gaining currency. Increasing N without limit makes sense for purely measurement-focused research, where the magnitude of δ itself is of interest, but it makes less sense for theory-focused research, where the truth status of the theory under investigation is of interest. Increasing power to enhance replicability will increase true positives at the level of the effect size (statistical true positives) while increasing false positives at the level of theory (theoretical false positives). With too much power, the cumulative foundation of psychological science would consist largely of nuisance effects masquerading as theoretically important discoveries. Positive predictive value at the level of theory is maximized by using an optimal N, one that is neither too small nor too large.

Keywords: False positives; Null hypothesis significance testing; Positive predictive value; Replication crisis.

PubMed Disclaimer

Similar articles

Cited by

References

    1. Asendorpt, J. B., et al. (2013). Recommendations for increasing replicability in psychology. European Journal of Personality, 27, 108–119. - DOI
    1. Bakan, D. (1966). The test of significance in psychological research. Psychological Bulletin, 66, 423–437. - PubMed - DOI
    1. Baribault, B., Donkin, C., Little, D. R., Trueblood, J. S., Oravecz, Z., van Ravenzwaaij, D., White, C. N., De Boeck, P., & Vandekerckhove, J. (2018). Metastudies for robust tests of theory. Proceedings of the National Academy of Sciences of the United States of America, 115, 2607–2612. - PubMed - PMC - DOI
    1. Bishop, D. (2019). Rein in the four horsemen of irreproducibility. Nature, 568, 435. - PubMed - DOI
    1. Bolch, G., Greiner, S., de Meer, H., Trivedi, K. S. (1998). Queueing Networks and Markov Chains (Chapter 1, pp. 1-34). John Wiley & Sons.

LinkOut - more resources