Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Mar;79(1):332-343.
doi: 10.1111/biom.13571. Epub 2021 Oct 12.

Generalized case-control sampling under generalized linear models

Affiliations

Generalized case-control sampling under generalized linear models

Jacob M Maronge et al. Biometrics. 2023 Mar.

Abstract

A generalized case-control (GCC) study, like the standard case-control study, leverages outcome-dependent sampling (ODS) to extend to nonbinary responses. We develop a novel, unifying approach for analyzing GCC study data using the recently developed semiparametric extension of the generalized linear model (GLM), which is substantially more robust to model misspecification than existing approaches based on parametric GLMs. For valid estimation and inference, we use a conditional likelihood to account for the biased sampling design. We describe analysis procedures for estimation and inference for the semiparametric GLM under a conditional likelihood, and we discuss problems with estimation and inference under a conditional likelihood when the response distribution is misspecified. We demonstrate the flexibility of our approach over existing ones through extensive simulation studies, and we apply the methodology to an analysis of the Asset and Health Dynamics Among the Oldest Old study, which motives our research. The proposed approach yields a simple yet versatile solution for handling ODS in a wide variety of possible response distributions and sampling schemes encountered in practice.

Keywords: conditional likelihood; efficiency; generalized case-control studies; generalized linear models; outcome-dependent sampling.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
Example of resulting response distribution for varying amount of overdispersion (25%, 50%, and 100%) compared to generating binomial responses with probability equal to μi. This figure appears in color in the electronic version of this article, and any mention of color refers to that version

References

    1. Anderson JA (1972) Separate sample logistic discrimination. Biometrika, 59, 19–35.
    1. Breslow NE (1996) Statistics in epidemiology: the case-control study. Journal of the American Statistical Association, 91, 14–28. - PubMed
    1. Breslow N & Cain KC (1988) Logistic regression for two-stage case-control data. Biometrika, 75, 11–20.
    1. Breslow NE & Chatterjee N (1999) Design and analysis of two-phase studies with binary outcome applied to Wilms’ tumour prognosis. Journal of the Royal Statistical Society. Series C (Applied Statistics), 48, 457–468.
    1. Breslow N & Day N (1980) Statistical methods in cancer research. Lyon: IARC Scientific Publications, International Agency for Research on Cancer.

Publication types