Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Jul 27:9:56.
doi: 10.1186/1471-2288-9-56.

Bias in odds ratios by logistic regression modelling and sample size

Affiliations

Bias in odds ratios by logistic regression modelling and sample size

Szilard Nemes et al. BMC Med Res Methodol. .

Abstract

Background: In epidemiological studies researchers use logistic regression as an analytical tool to study the association of a binary outcome to a set of possible exposures.

Methods: Using a simulation study we illustrate how the analytically derived bias of odds ratios modelling in logistic regression varies as a function of the sample size.

Results: Logistic regression overestimates odds ratios in studies with small to moderate samples size. The small sample size induced bias is a systematic one, bias away from null. Regression coefficient estimates shifts away from zero, odds ratios from one.

Conclusion: If several small studies are pooled without consideration of the bias introduced by the inherent mathematical properties of the logistic regression model, researchers may be mislead to erroneous interpretation of the results.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Coefficient estimates and its sample size dependent systematic bias in logistic regression estimates. The deviance from the true population value (2 respectively -0.9 in this case) represents the analytically induced bias in regression estimates.
Figure 2
Figure 2
Sampling distribution of logistic regression coefficient estimates at different sample sizes.
Figure 3
Figure 3
Increasing sample size not only reduces the analytically induced bias in regression estimates but protects against extreme value estimates.

References

    1. Steineck G, Hunt H, Adolfsson J. A hierarchical step-model of bias – Evaluating cancer treatment with epidemiological Methods. Acta Oncologica. 2006;45:421–429. doi: 10.1080/02841860600649293. - DOI - PubMed
    1. Agresti A. Categorical Data Analysis. Wiley Series in Probability and Statistics, New Jersey, John Wiley & Sons Inc; 1990.
    1. Firth D. Bias reduction of maximum likelihood estimates. Biometrica. 1993;80(1):27–38. doi: 10.1093/biomet/80.1.27. - DOI
    1. Cox DR, Hinkley DV. Theoretical Statistics. Chapman and Hall, London; 1982.
    1. Jewell NP. Small-sample bias of point estimators of the odds ratio from matched sets. Biometrics. 1984;40:412–435. doi: 10.2307/2531395. - DOI - PubMed

Publication types