Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Mar 15;36(6):985-997.
doi: 10.1002/sim.7195. Epub 2016 Dec 14.

Statistical inferences for data from studies conducted with an aggregated multivariate outcome-dependent sample design

Affiliations

Statistical inferences for data from studies conducted with an aggregated multivariate outcome-dependent sample design

Tsui-Shan Lu et al. Stat Med. .

Abstract

Outcome-dependent sampling (ODS) scheme is a cost-effective sampling scheme where one observes the exposure with a probability that depends on the outcome. The well-known such design is the case-control design for binary response, the case-cohort design for the failure time data, and the general ODS design for a continuous response. While substantial work has been carried out for the univariate response case, statistical inference and design for the ODS with multivariate cases remain under-developed. Motivated by the need in biological studies for taking the advantage of the available responses for subjects in a cluster, we propose a multivariate outcome-dependent sampling (multivariate-ODS) design that is based on a general selection of the continuous responses within a cluster. The proposed inference procedure for the multivariate-ODS design is semiparametric where all the underlying distributions of covariates are modeled nonparametrically using the empirical likelihood methods. We show that the proposed estimator is consistent and developed the asymptotically normality properties. Simulation studies show that the proposed estimator is more efficient than the estimator obtained using only the simple-random-sample portion of the multivariate-ODS or the estimator from a simple random sample with the same sample size. The multivariate-ODS design together with the proposed estimator provides an approach to further improve study efficiency for a given fixed study budget. We illustrate the proposed design and estimator with an analysis of association of polychlorinated biphenyl exposure to hearing loss in children born to the Collaborative Perinatal Study. Copyright © 2016 John Wiley & Sons, Ltd.

Keywords: continuous multivariate responses; correlated responses; empirical likelihood; outcome-dependent sampling; semiparametric.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Relative efficiency of θ̂P to θ̂S for β̂1 across the sampling fraction of the supplemental samples (n1/n + n2/n), under the bivariate model with n = 200 (Multivariate-ODS), α1 = 0.5, β1 = −0.5, α2 = −0.8, β2 = ln(2), σ1 = σ2 = 1, ρ = 0.5, and X1 = X2 ~ N(0, 1); U = 90% and L = 10%.

Similar articles

References

    1. Cornfield J. A method of estimating comparative rates from clinical data. Applications to cancer of the lung, breast, and cervix. Journal of the National Cancer Institute. 1951;11:1269–1275. - PubMed
    1. White H. Maximum likelihood estimation of misspecified models. Econometrica. 1982;50:1–25.
    1. Breslow NE, Cain KC. Logistic regression for two-stage case-control data. Biometrika. 1988;75:11–20.
    1. Wang X, Zhou H. A semiparametric empirical likelihood model for biased sampling schemes with auxiliary covariates. Biometrics. 2006;62:1149–1160. - PubMed
    1. Wang X, Wu Y, Zhou H. Outcome and auxiliary-dependent subsampling and its statistical inference. Journal of Biopharmaceutical Statistics. 2009;19:1132–1150. - PMC - PubMed

Publication types

LinkOut - more resources