Ubiquitous bias and false discovery due to model misspecification in analysis of statistical interactions: The role of the outcome's distribution and metric properties
- PMID: 36201820
- PMCID: PMC10369499
- DOI: 10.1037/met0000532
Ubiquitous bias and false discovery due to model misspecification in analysis of statistical interactions: The role of the outcome's distribution and metric properties
Abstract
Studies of interaction effects are of great interest because they identify crucial interplay between predictors in explaining outcomes. Previous work has considered several potential sources of statistical bias and substantive misinterpretation in the study of interactions, but less attention has been devoted to the role of the outcome variable in such research. Here, we consider bias and false discovery associated with estimates of interaction parameters as a function of the distributional and metric properties of the outcome variable. We begin by illustrating that, for a variety of noncontinuously distributed outcomes (i.e., binary and count outcomes), attempts to use the linear model for recovery leads to catastrophic levels of bias and false discovery. Next, focusing on transformations of normally distributed variables (i.e., censoring and noninterval scaling), we show that linear models again produce spurious interaction effects. We provide explanations offering geometric and algebraic intuition as to why interactions are a challenge for these incorrectly specified models. In light of these findings, we make two specific recommendations. First, a careful consideration of the outcome's distributional properties should be a standard component of interaction studies. Second, researchers should approach research focusing on interactions with heightened levels of scrutiny. (PsycInfo Database Record (c) 2024 APA, all rights reserved).
Similar articles
-
Comparing estimators for latent interaction models under structural and distributional misspecifications.Psychol Methods. 2020 Jun;25(3):321-345. doi: 10.1037/met0000231. Epub 2019 Oct 31. Psychol Methods. 2020. PMID: 31670539
-
Worse than measurement error: Consequences of inappropriate latent variable measurement models.Psychol Methods. 2020 Feb;25(1):30-45. doi: 10.1037/met0000220. Epub 2019 Jun 6. Psychol Methods. 2020. PMID: 31169371
-
Interaction analysis under misspecification of main effects: Some common mistakes and simple solutions.Stat Med. 2020 May 20;39(11):1675-1694. doi: 10.1002/sim.8505. Epub 2020 Feb 26. Stat Med. 2020. PMID: 32101638
-
Subgroup analyses in randomised controlled trials: quantifying the risks of false-positives and false-negatives.Health Technol Assess. 2001;5(33):1-56. doi: 10.3310/hta5330. Health Technol Assess. 2001. PMID: 11701102 Review.
-
Centering categorical predictors in multilevel models: Best practices and interpretation.Psychol Methods. 2023 Jun;28(3):613-630. doi: 10.1037/met0000434. Epub 2021 Dec 16. Psychol Methods. 2023. PMID: 34914468 Review.
Cited by
-
Genomic Taxometric Analysis of Negative Emotionality and Major Depressive Disorder Highlights a Gradient of Genetic Differentiation across the Severity Spectrum.medRxiv [Preprint]. 2025 Feb 3:2025.01.30.25321336. doi: 10.1101/2025.01.30.25321336. medRxiv. 2025. PMID: 39974100 Free PMC article. Preprint.
-
Many roads to a gene-environment interaction.Am J Hum Genet. 2024 Apr 4;111(4):626-635. doi: 10.1016/j.ajhg.2024.03.002. Am J Hum Genet. 2024. PMID: 38579668 Free PMC article. Review.
-
Factorizing polygenic epistasis improves prediction and uncovers biological pathways in complex traits.Am J Hum Genet. 2023 Nov 2;110(11):1875-1887. doi: 10.1016/j.ajhg.2023.10.002. Am J Hum Genet. 2023. PMID: 37922884 Free PMC article.
-
Advantageous early-life environments cushion the genetic risk for ischemic heart disease.Proc Natl Acad Sci U S A. 2024 Jul 2;121(27):e2314056121. doi: 10.1073/pnas.2314056121. Epub 2024 Jun 25. Proc Natl Acad Sci U S A. 2024. PMID: 38917008 Free PMC article.
References
-
- Angosta J, Steers M-LN, Steers K, Riggs JL, & Neighbors C (2019). Who cares if college and drinking are synonymous? Identification with typical students moderates the relationship between college life alcohol salience and drinking outcomes. Addictive behaviors, 98, 106046. doi: 10.1016/j.addbeh.2019.106046 - DOI - PMC - PubMed
-
- Ballou D (2009). Test scaling and value-added measurement. Education finance and Policy, 4 (4), 351–383. doi: 10.1162/edfp.2009.4.4.351 - DOI
-
- Briggs D, & Betebenner D (2009). Is growth in student achievement scale dependent. Unpublished manuscript. Retrieved from https://www.cde.state.co.us/sites/default/files/documents/research/downl...
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources