Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Dec;29(6):1164-1179.
doi: 10.1037/met0000532. Epub 2022 Oct 6.

Ubiquitous bias and false discovery due to model misspecification in analysis of statistical interactions: The role of the outcome's distribution and metric properties

Affiliations

Ubiquitous bias and false discovery due to model misspecification in analysis of statistical interactions: The role of the outcome's distribution and metric properties

Benjamin W Domingue et al. Psychol Methods. 2024 Dec.

Abstract

Studies of interaction effects are of great interest because they identify crucial interplay between predictors in explaining outcomes. Previous work has considered several potential sources of statistical bias and substantive misinterpretation in the study of interactions, but less attention has been devoted to the role of the outcome variable in such research. Here, we consider bias and false discovery associated with estimates of interaction parameters as a function of the distributional and metric properties of the outcome variable. We begin by illustrating that, for a variety of noncontinuously distributed outcomes (i.e., binary and count outcomes), attempts to use the linear model for recovery leads to catastrophic levels of bias and false discovery. Next, focusing on transformations of normally distributed variables (i.e., censoring and noninterval scaling), we show that linear models again produce spurious interaction effects. We provide explanations offering geometric and algebraic intuition as to why interactions are a challenge for these incorrectly specified models. In light of these findings, we make two specific recommendations. First, a careful consideration of the outcome's distributional properties should be a standard component of interaction studies. Second, researchers should approach research focusing on interactions with heightened levels of scrutiny. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
Scatterplot of E(y|x,z) and xz for different values of β0β1=β2=1,β3=0,N=1000 when y is a binary outcome.
Figure 2:
Figure 2:
Illustration of the geometry driving false discovery due to variation in β0 when the linear model is used β1=β2=1,β3=0,N=1000 for analysis of binary outcomes. Blue and red dots represent those data points within half a unit of their respective z values (i.e., z such that |z1|<0.5 are in blue and z such that |z+1|<0.5 are in red. Fitted lines for ±1 are similarly shaded). Solid lines are fits from logistic regression model while dashed lines are fits from linear model.
Figure 3:
Figure 3:
Variation in bias (left) and Type 1 error rate (right) associated with estimates of β3 when β3=0 for binary outcomes (for β0=0,β1=β2=1). Left: Estimates β3LM^ based on the LM for two sample sizes as a function of β0. Shaded regions capture span of estimates between 10th and 90th percentile while the solid line shows median estimates. Right: Levels of Type 1 error as a function of sample size for two values of β0.
Figure 4:
Figure 4:
Type 1 error rate as a function of sample size and prevalence when using LM to analyze binary outcomes. In all cases, β3=0 while β0 varies (such that stated prevalence is equivalent to eβ01+eβ0). The β1 and β2 coefficients are as shown for each panel. Blue regions indicate areas where Type 1 error is appropriate whereas red indicates regions wherein bias is leading to elevated levels of Type 1 error.
Figure 5:
Figure 5:
Scatterplot of E(y|x,z) and xz for different values of β2β0=1,β1=0.2,N=1000 when yi is a count outcome.
Figure 6:
Figure 6:
Illustration of the geometry driving false discovery due to variation in β2 when the linear model is used β0=1,β1=0.2,N=1000 for analysis of count outcomes. Blue and red dots represent those data points within half a unit of their respective z values (i.e., z such that |z1|<0.5 are in blue and z such that |z+1|<0.5 are in red. Fitted lines for ±1 are similarly shaded). Solid lines are fits from Poisson regression model while dashed lines are fits from linear model.
Figure 7:
Figure 7:
Variation in bias (left) and Type 1 error rate (right) associated with β3LM^ when β3=0 for count outcomes (for β0=0,β1=0.5). Left: Estimates of β3 based on the LM for two sample sizes as a function of β2. Shaded regions capture span of estimates between 10th and 90th percentile while the solid line shows median estimates. Right: Levels of Type 1 error as a function of sample size for two values of β2.
Figure 8:
Figure 8:
Scatterplot of y or y and xz for different values of cβ0=0,β1=β2=1,σy2=0.25,N=1000 when yi is a censored outcome.
Figure 9:
Figure 9:
Illustration of the geometry driving false discovery due to variation in c when the linear model is used β0=0,β1=β2=1,σy2=0.25,N=1000) for analysis of censored outcomes. Blue and red dots represent those data points within half a unit of their respective z values (i.e., z such that |z1|<0.5 are in blue and z such that |z+1|<0.5 are in red. Fitted lines for ±1 are similarly shaded). Solid lines are fits from Tobit regression model while dashed lines are fits from linear model.
Figure 10:
Figure 10:
Variation in bias (left) and Type 1 error rate (right) associated with β3LM^ when β3=0 for outcomes with a floor β0=0,β1=β2=1,σy2=1. Left: Estimates of β3 based on the LM for two sample sizes as a function of c. Shaded regions capture span of estimates between 10th and 90th percentile while the solid line shows median estimates. Right: Levels of Type 1 error as a function of sample size for two values of c.
Figure 11:
Figure 11:
Effect of transformation when α=0 for various values of λ.
Figure 12:
Figure 12:
Variation in bias (left) and Type 1 error rate (right) associated with β3LM^ when β3=0 for outcomes observed following Lord’s transformation β0=0,β1=β2=1,σy2=1,α=0. Left: Estimates of β3 based on the LM for two sample sizes as a function of λ. Shaded regions capture span of estimates between 10th and 90th percentile while the solid line shows median estimates. Right: Levels of Type 1 error as a function of sample size for two values of λ.

References

    1. Ancelin M-L, Scali J, Norton J, Ritchie K, Dupuy A-M, Chaudieu I, & Ryan J (2017). Heterogeneity in HPA axis dysregulation and serotonergic vulnerability to depression. Psychoneuroendocrinology, 77, 90–94. doi: 10.1016/j.psyneuen.2016.11.016 - DOI - PubMed
    1. Angosta J, Steers M-LN, Steers K, Riggs JL, & Neighbors C (2019). Who cares if college and drinking are synonymous? Identification with typical students moderates the relationship between college life alcohol salience and drinking outcomes. Addictive behaviors, 98, 106046. doi: 10.1016/j.addbeh.2019.106046 - DOI - PMC - PubMed
    1. Ballou D (2009). Test scaling and value-added measurement. Education finance and Policy, 4 (4), 351–383. doi: 10.1162/edfp.2009.4.4.351 - DOI
    1. Battey H, Cox D, & Jackson M (2019). On the linear in probability model for binary data. Royal Society open science, 6(5), 190067. doi: 10.1098/rsos.190067 - DOI - PMC - PubMed
    1. Briggs D, & Betebenner D (2009). Is growth in student achievement scale dependent. Unpublished manuscript. Retrieved from https://www.cde.state.co.us/sites/default/files/documents/research/downl...