On outcome-dependent sampling designs for longitudinal binary response data with time-varying covariates
- PMID: 18372397
- PMCID: PMC2733177
- DOI: 10.1093/biostatistics/kxn006
On outcome-dependent sampling designs for longitudinal binary response data with time-varying covariates
Abstract
A typical longitudinal study prospectively collects both repeated measures of a health status outcome as well as covariates that are used either as the primary predictor of interest or as important adjustment factors. In many situations, all covariates are measured on the entire study cohort. However, in some scenarios the primary covariates are time dependent yet may be ascertained retrospectively after completion of the study. One common example would be covariate measurements based on stored biological specimens such as blood plasma. While authors have previously proposed generalizations of the standard case-control design in which the clustered outcome measurements are used to selectively ascertain covariates (Neuhaus and Jewell, 1990) and therefore provide resource efficient collection of information, these designs do not appear to be commonly used. One potential barrier to the use of longitudinal outcome-dependent sampling designs would be the lack of a flexible class of likelihood-based analysis methods. With the relatively recent development of flexible and practical methods such as generalized linear mixed models (Breslow and Clayton, 1993) and marginalized models for categorical longitudinal data (see Heagerty and Zeger, 2000, for an overview), the class of likelihood-based methods is now sufficiently well developed to capture the major forms of longitudinal correlation found in biomedical repeated measures data. Therefore, the goal of this manuscript is to promote the consideration of outcome-dependent longitudinal sampling designs and to both outline and evaluate the basic conditional likelihood analysis allowing for valid statistical inference.
Figures
References
-
- Anderson JA. Separate sample logistic discrimination. Biometrika. 1972;59:19–35.
-
- Azzalini A. Logistic regression for autocorrelated data with application to repeated measures. Biometrika. 1994;81:767–775.
-
- Breslow NE, Clayton DG. Approximate inference in generalized linear mixed models. Journal of the American Statistical Association. 1993;88:9–25.
-
- Diggle P, Heagerty PJ, Liang K-Y, Zeger SL. Analysis of Longitudinal Data. New York: Oxford University Press; 2002.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
