Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Dec;72(6):558-569.
doi: 10.4097/kja.19087. Epub 2019 Jul 15.

Multicollinearity and misleading statistical results

Affiliations

Multicollinearity and misleading statistical results

Jong Hae Kim. Korean J Anesthesiol. 2019 Dec.

Abstract

Multicollinearity represents a high degree of linear intercorrelation between explanatory variables in a multiple regression model and leads to incorrect results of regression analyses. Diagnostic tools of multicollinearity include the variance inflation factor (VIF), condition index and condition number, and variance decomposition proportion (VDP). The multicollinearity can be expressed by the coefficient of determination (Rh2) of a multiple regression model with one explanatory variable (Xh) as the model's response variable and the others (Xi [i ≠ h]) as its explanatory variables. The variance (σh2) of the regression coefficients constituting the final regression model are proportional to the VIF. Hence, an increase in Rh2 (strong multicollinearity) increases σh2. The larger σh2 produces unreliable probability values and confidence intervals of the regression coefficients. The square root of the ratio of the maximum eigenvalue to each eigenvalue from the correlation matrix of standardized explanatory variables is referred to as the condition index. The condition number is the maximum condition index. Multicollinearity is present when the VIF is higher than 5 to 10 or the condition indices are higher than 10 to 30. However, they cannot indicate multicollinear explanatory variables. VDPs obtained from the eigenvectors can identify the multicollinear variables by showing the extent of the inflation of σh2 according to each condition index. When two or more VDPs, which correspond to a common condition index higher than 10 to 30, are higher than 0.8 to 0.9, their associated explanatory variables are multicollinear. Excluding multicollinear explanatory variables leads to statistically stable multiple regression models.

Keywords: Biomedical research; Biostatistics; Multivariable analysis; Regression; Statistical bias; Statistical data analysis.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest

No potential conflict of interest relevant to this article was reported.

Figures

Fig. 1.
Fig. 1.
The effects of the coefficient of determination (Rj 2 ) from a regression model [Xij = γ0 + l=1k γl Xil + ϵi  (i = 1, 2,  , n; l = 1, 2,  , k; l j)] on the variance of a regression coefficient of interest [Var(βj)]. The presence of multicollinearity (an increase in Rj2) inflates Var(βj). Xij: jth explanatory variable of a regression model [Yi = β0 + β1Xi1 + β2Xi2 + , … , + βkXik + εi (i = 1,2, … , n)].

Similar articles

Cited by

References

    1. Kim JH, Kim TK, In J, Lee DK, Lee S, Kang H. Assessment of risk of bias in quasi-randomized controlled trials and randomized controlled trials reported in the Korean Journal of Anesthesiology between 2010 and 2016. Korean J Anesthesiol. 2017;70:511–9. - PMC - PubMed
    1. Vatcheva KP, Lee M, McCormick JB, Rahbar MH. Multicollinearity in regression analyses conducted in epidemiologic studies. Epidemiology (Sunnyvale) 2016;6:227. - PMC - PubMed
    1. McDonald GC. Ridge regression. Wiley Interdiscip Rev Comput Stat. 2009;1:93–100.
    1. Byun SH, Yang HS, Kim JH. Liver graft hyperperfusion in the early postoperative period promotes hepatic regeneration 2 weeks after living donor liver transplantation: a prospective observational cohort study. Medicine (Baltimore) 2016;95:e5404. - PMC - PubMed
    1. Sato Y, Tsukada K, Hatakeyama K. Role of shear stress and immune responses in liver regeneration after a partial hepatectomy. Surg Today. 1999;29:1–9. - PubMed

LinkOut - more resources