Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2008 Sep 29;4(1):Article 19.
doi: 10.2202/1557-4679.1115.

Simple optimal weighting of cases and controls in case-control studies

Affiliations
Comparative Study

Simple optimal weighting of cases and controls in case-control studies

Sherri Rose et al. Int J Biostat. .

Abstract

Researchers of uncommon diseases are often interested in assessing potential risk factors. Given the low incidence of disease, these studies are frequently case-control in design. Such a design allows a sufficient number of cases to be obtained without extensive sampling and can increase efficiency; however, these case-control samples are then biased since the proportion of cases in the sample is not the same as the population of interest. Methods for analyzing case-control studies have focused on utilizing logistic regression models that provide conditional and not causal estimates of the odds ratio. This article will demonstrate the use of the prevalence probability and case-control weighted targeted maximum likelihood estimation (MLE), as described by van der Laan (2008), in order to obtain causal estimates of the parameters of interest (risk difference, relative risk, and odds ratio). It is meant to be used as a guide for researchers, with step-by-step directions to implement this methodology. We will also present simulation studies that show the improved efficiency of the case-control weighted targeted MLE compared to other techniques.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
Predicted Probabilities for Misspecified *(A, W).
Figure 2:
Figure 2:
Simulation 1 – Bias Results. (Bias results for the case-control weighted targeted maximum likelihood with misspecified ĝ*(A | W) and the correctly specified case-control weighted targeted maximum likelihood were excluded since those values were the same as those for the targeted maximum likelihood with correctly specified *(A, W) and ĝ*(A | W).)
Figure 3:
Figure 3:
Simulation 2 – Bias Results for the Odds Ratio. (Bias results for the case-control weighted targeted maximum likelihood with misspecified ĝ*(A | W) were excluded since those values were the same as those for the targeted maximum likelihood with correctly specified *(A, W) and ĝ*(A | W).)

Similar articles

Cited by

References

    1. Anderson JA. Separate sample logistic discrimination. Biometrika. 1972;59:19–35.
    1. Bembom O, Peterson ML, Rhee S-Y, Fessel WJ, Sinisi SE, Shafer RW, van der Laan MJ. Technical Report 221, Division of Biostatistics. University of California; Berkeley: 2007. Biomarker discovery using targeted maximum likelihood estimation: Application to the treatment of antiretroviral resistant hiv infection. - PMC - PubMed
    1. Benichou J, Wacholder S. A comparison of three approaches to estimate exposure-specific incidence rates from population-based case-control data. Statistics in Medicine. 1994;13:651–661. - PubMed
    1. Breslow NE. Statistics in epidemiology: the case-control study. J Am Stat Soc. 1996;91:14–28. - PubMed
    1. Breslow NE, Day NE. Statistical Methods in Cancer Research: Volume 1 – The analysis of case-control studies. International Agency for Research on Cancer; Lyon: 1980. - PubMed

Publication types

LinkOut - more resources