Secondary outcome analysis for data from an outcome-dependent sampling design
- PMID: 29682775
- PMCID: PMC6130921
- DOI: 10.1002/sim.7672
Secondary outcome analysis for data from an outcome-dependent sampling design
Abstract
Outcome-dependent sampling (ODS) scheme is a cost-effective way to conduct a study. For a study with continuous primary outcome, an ODS scheme can be implemented where the expensive exposure is only measured on a simple random sample and supplemental samples selected from 2 tails of the primary outcome variable. With the tremendous cost invested in collecting the primary exposure information, investigators often would like to use the available data to study the relationship between a secondary outcome and the obtained exposure variable. This is referred as secondary analysis. Secondary analysis in ODS designs can be tricky, as the ODS sample is not a random sample from the general population. In this article, we use the inverse probability weighted and augmented inverse probability weighted estimating equations to analyze the secondary outcome for data obtained from the ODS design. We do not make any parametric assumptions on the primary and secondary outcome and only specify the form of the regression mean models, thus allow an arbitrary error distribution. Our approach is robust to second- and higher-order moment misspecification. It also leads to more precise estimates of the parameters by effectively using all the available participants. Through simulation studies, we show that the proposed estimator is consistent and asymptotically normal. Data from the Collaborative Perinatal Project are analyzed to illustrate our method.
Keywords: biased sampling; estimating equation; missing data; secondary analysis; semiparametric estimation; validation sample.
Copyright © 2018 John Wiley & Sons, Ltd.
Conflict of interest statement
None declared.
Figures


Similar articles
-
A semiparametric empirical likelihood method for data from an outcome-dependent sampling scheme with a continuous outcome.Biometrics. 2002 Jun;58(2):413-21. doi: 10.1111/j.0006-341x.2002.00413.x. Biometrics. 2002. PMID: 12071415
-
Statistical inferences for data from studies conducted with an aggregated multivariate outcome-dependent sample design.Stat Med. 2017 Mar 15;36(6):985-997. doi: 10.1002/sim.7195. Epub 2016 Dec 14. Stat Med. 2017. PMID: 27966260 Free PMC article.
-
Statistical inference for the additive hazards model under outcome-dependent sampling.Can J Stat. 2015 Sep;43(3):436-453. doi: 10.1002/cjs.11257. Can J Stat. 2015. PMID: 26379363 Free PMC article.
-
Best linear inverse probability weighted estimation for two-phase designs and missing covariate regression.Stat Med. 2019 Jul 10;38(15):2783-2796. doi: 10.1002/sim.8141. Epub 2019 Mar 25. Stat Med. 2019. PMID: 30908669 Free PMC article.
-
Recent progresses in outcome-dependent sampling with failure time data.Lifetime Data Anal. 2017 Jan;23(1):57-82. doi: 10.1007/s10985-015-9355-7. Epub 2016 Jan 13. Lifetime Data Anal. 2017. PMID: 26759313 Free PMC article. Review.
Cited by
-
Plasma proteomics reveals markers of metabolic stress in HIV infected children with severe acute malnutrition.Sci Rep. 2020 Jul 8;10(1):11235. doi: 10.1038/s41598-020-68143-7. Sci Rep. 2020. PMID: 32641735 Free PMC article. Clinical Trial.
References
-
- Cornfield J. Method of estimating comparative rates from clinical data. Application to cancer of the lung, breast, and cervix. J Natl Cancer Inst. 1951;11(6):1269–1275. - PubMed
-
- Prentice R. A case-cohort design for epidemiologic cohort studies and disease prevention trials. Biometrika. 1986;73(1):1–11.
-
- Zhou H, Weaver M, Qin J, Longnecker M, Wang M. A semiparametric empirical likelihood method for data from an outcome-dependent sampling scheme with a continuous outcome. Biometrics. 2002;58(2):413–421. - PubMed
-
- Longnecker M, Klebanoff M, Zhou H, Wilcox A, Berendes H, Hoffman H. Proposal to Study in Utero Exposure to DDE and PCBs in Relation to Male Birth Defects and Neurodevelopmental Outcomes in the Collaborative Perinatal Project. Washington, DC: Study Proposal, National Institute of Environmental Health Sciences; 1997.
-
- Weaver M, Zhou H. An estimated likelihood method for continuous outcome regression models with outcome-dependent sampling. J Am Stat Assoc. 2005;100(470):459–469.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources