Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Mar 2:6:57.
doi: 10.3389/fpubh.2018.00057. eCollection 2018.

Unconditional or Conditional Logistic Regression Model for Age-Matched Case-Control Data?

Affiliations

Unconditional or Conditional Logistic Regression Model for Age-Matched Case-Control Data?

Chia-Ling Kuo et al. Front Public Health. .

Abstract

Matching on demographic variables is commonly used in case-control studies to adjust for confounding at the design stage. There is a presumption that matched data need to be analyzed by matched methods. Conditional logistic regression has become a standard for matched case-control data to tackle the sparse data problem. The sparse data problem, however, may not be a concern for loose-matching data when the matching between cases and controls is not unique, and one case can be matched to other controls without substantially changing the association. Data matched on a few demographic variables are clearly loose-matching data, and we hypothesize that unconditional logistic regression is a proper method to perform. To address the hypothesis, we compare unconditional and conditional logistic regression models by precision in estimates and hypothesis testing using simulated matched case-control data. Our results support our hypothesis; however, the unconditional model is not as robust as the conditional model to the matching distortion that the matching process not only makes cases and controls similar for matching variables but also for the exposure status. When the study design involves other complex features or the computational burden is high, matching in loose-matching data can be ignored for negligible loss in testing and estimation if the distributions of matching variables are not extremely different between cases and controls.

Keywords: frequency matching; individual matching; loose matching; precision in estimates and tests; sparse data problem; width of 95% confidence interval.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Age distributions of cases (white) and controls (grey) in the population, where the age distributions of exposed and unexposed subjects are N(70, 102) and N(65, 102), respectively, and OR (agex10) denotes odds ratio associated with a 10-year increase in age.
Figure 2
Figure 2
Age distributions of cases (white) and controls (grey) in the population where the age distributions of exposed and unexposed subjects are N(70, 102) and N(50, 102) and OR (agex10) denotes odds ratio associated with a 10-year increase in age.

References

    1. Breslow NE, Day NE, Halvorsen KT, Prentice RL, Sabai C. Estimation of multiple relative risk functions in matched case-control studies. Am J Epidemiol (1978) 108(4):299–307. 10.1093/oxfordjournals.aje.a112623 - DOI - PubMed
    1. Costanza MC. Matching. Prev Med (1995) 24(5):425–33. 10.1006/pmed.1995.1069 - DOI - PubMed
    1. Kupper LL, Karon JM, Kleinbaum DG, Morgenstern H, Lewis DK. Matching in epidemiologic studies: validity and efficiency considerations. Biometrics (1981) 37(2):271–91. 10.2307/2530417 - DOI - PubMed
    1. Miettinen OS. Estimation of relative risk from individually matched series. Biometrics (1970) 26(1):75–86. 10.2307/2529046 - DOI - PubMed
    1. Wacholder S, Silverman DT, McLaughlin JK, Mandel JS. Selection of controls in case-control studies. II. Types of controls. Am J Epidemiol (1992) 135(9):1029–41. 10.1093/oxfordjournals.aje.a116397 - DOI - PubMed

LinkOut - more resources