On semiparametric efficient inference for two-stage outcome-dependent sampling with a continuous outcome
- PMID: 20107493
- PMCID: PMC2761000
- DOI: 10.1093/biomet/asn073
On semiparametric efficient inference for two-stage outcome-dependent sampling with a continuous outcome
Abstract
Outcome-dependent sampling designs have been shown to be a cost effective way to enhance study efficiency. We show that the outcome-dependent sampling design with a continuous outcome can be viewed as an extension of the two-stage case-control designs to the continuous-outcome case. We further show that the two-stage outcome-dependent sampling has a natural link with the missing-data and biased-sampling framework. Through the use of semiparametric inference and missing-data techniques, we show that a certain semiparametric maximum likelihood estimator is computationally convenient and achieves the semiparametric efficient information bound. We demonstrate this both theoretically and through simulation.
Similar articles
-
Semiparametric Inference for Data with a Continuous Outcome from a Two-Phase Probability Dependent Sampling Scheme.J R Stat Soc Series B Stat Methodol. 2014 Jan 1;76(1):197-215. doi: 10.1111/rssb.12029. J R Stat Soc Series B Stat Methodol. 2014. PMID: 24737947 Free PMC article.
-
A semiparametric empirical likelihood method for data from an outcome-dependent sampling scheme with a continuous outcome.Biometrics. 2002 Jun;58(2):413-21. doi: 10.1111/j.0006-341x.2002.00413.x. Biometrics. 2002. PMID: 12071415
-
Semiparametric inference for a two-stage outcome-dependent sampling design with interval-censored failure time data.Lifetime Data Anal. 2020 Jan;26(1):85-108. doi: 10.1007/s10985-019-09461-5. Epub 2019 Jan 7. Lifetime Data Anal. 2020. PMID: 30617753 Free PMC article.
-
Analysis of case-cohort designs with binary outcomes: Improving efficiency using whole-cohort auxiliary information.Stat Methods Med Res. 2017 Apr;26(2):691-706. doi: 10.1177/0962280214556175. Epub 2014 Oct 26. Stat Methods Med Res. 2017. PMID: 25348675 Review.
-
Nonparametric and semiparametric regression estimation for length-biased survival data.Lifetime Data Anal. 2017 Jan;23(1):3-24. doi: 10.1007/s10985-016-9367-y. Epub 2016 Apr 16. Lifetime Data Anal. 2017. PMID: 27086362 Free PMC article. Review.
Cited by
-
Outcome-Dependent Sampling Design and Inference for Cox's Proportional Hazards Model.J Stat Plan Inference. 2016 Nov;178:24-36. doi: 10.1016/j.jspi.2016.05.001. Epub 2016 May 17. J Stat Plan Inference. 2016. PMID: 28090134 Free PMC article.
-
A SEMIPARAMETRIC METHOD FOR RISK PREDICTION USING INTEGRATED ELECTRONIC HEALTH RECORD DATA.Ann Appl Stat. 2024 Dec;18(4):3318-3337. doi: 10.1214/24-AOAS1938. Epub 2024 Oct 31. Ann Appl Stat. 2024. PMID: 40134753 Free PMC article.
-
Efficient Semiparametric Inference Under Two-Phase Sampling, With Applications to Genetic Association Studies.J Am Stat Assoc. 2017;112(520):1468-1476. doi: 10.1080/01621459.2017.1295864. Epub 2017 Feb 28. J Am Stat Assoc. 2017. PMID: 29479125 Free PMC article.
-
Two-phase designs for joint quantitative-trait-dependent and genotype-dependent sampling in post-GWAS regional sequencing.Genet Epidemiol. 2018 Feb;42(1):104-116. doi: 10.1002/gepi.22099. Epub 2017 Dec 14. Genet Epidemiol. 2018. PMID: 29239496 Free PMC article.
-
Likelihood-based analysis of outcome-dependent sampling designs with longitudinal data.Stat Med. 2018 Jun 15;37(13):2120-2133. doi: 10.1002/sim.7633. Epub 2018 Mar 15. Stat Med. 2018. PMID: 29542170 Free PMC article.
References
-
- Bickel PJ, Klaassen CAJ, Ritov Y, Wellner JA. Efficient and Adaptive Estimation for Semiparametric Models. New York: Springer-Verlag; 1998.
-
- Breslow N, McNeney B, Wellner JA. Large sample theory for semiparametric regression models with two-phase, outcome dependent sampling. Ann. Statist. 2003;31:1110–1139.
-
- Breslow NE, Cain KC. Logistic regression for two-stage case-control data. Biometrika. 1988;75:11–20.
-
- Chatterjee N, Chen Y-H, Breslow NE. A pseudoscore estimator for regression problems with two-phase sampling. J. Am. Statist. Assoc. 2003;98:158–168.
-
- Cornfield J. A method of estimating comparatice rates from clinical data. J. Nat. Cancer Inst. 1951;11:1269–1275. - PubMed