Multiple outputation: inference for complex clustered data by averaging analyses from independent data
- PMID: 12926727
- DOI: 10.1111/1541-0420.00049
Multiple outputation: inference for complex clustered data by averaging analyses from independent data
Abstract
This article applies a simple method for settings where one has clustered data, but statistical methods are only available for independent data. We assume the statistical method provides us with a normally distributed estimate, theta, and an estimate of its variance sigma. We randomly select a data point from each cluster and apply our statistical method to this independent data. We repeat this multiple times, and use the average of the associated theta's as our estimate. An estimate of the variance is given by the average of the sigma2's minus the sample variance of the theta's. We call this procedure multiple outputation, as all "excess" data within each cluster is thrown out multiple times. Hoffman, Sen, and Weinberg (2001, Biometrika 88, 1121-1134) introduced this approach for generalized linear models when the cluster size is related to outcome. In this article, we demonstrate the broad applicability of the approach. Applications to angular data, p-values, vector parameters, Bayesian inference, genetics data, and random cluster sizes are discussed. In addition, asymptotic normality of estimates based on all possible outputations, as well as a finite number of outputations, is proven given weak conditions. Multiple outputation provides a simple and broadly applicable method for analyzing clustered data. It is especially suited to settings where methods for clustered data are impractical, but can also be applied generally as a quick and simple tool.
Similar articles
-
Marginal analyses of clustered data when cluster size is informative.Biometrics. 2003 Mar;59(1):36-42. doi: 10.1111/1541-0420.00005. Biometrics. 2003. PMID: 12762439
-
Within-Cluster Resampling for Analysis of Family Data: Ready for Prime-Time?Stat Interface. 2010 Apr 1;3(2):169-176. doi: 10.4310/sii.2010.v3.n2.a4. Stat Interface. 2010. PMID: 20664749 Free PMC article.
-
Part 1. Statistical Learning Methods for the Effects of Multiple Air Pollution Constituents.Res Rep Health Eff Inst. 2015 Jun;(183 Pt 1-2):5-50. Res Rep Health Eff Inst. 2015. PMID: 26333238
-
Power analyses for longitudinal trials and other clustered designs.Stat Med. 2004 Sep 30;23(18):2799-815. doi: 10.1002/sim.1869. Stat Med. 2004. PMID: 15344187 Review.
-
A comparison of methods for the analysis of binomial clustered outcomes in behavioral research.J Neurosci Methods. 2016 Dec 1;274:131-140. doi: 10.1016/j.jneumeth.2016.10.005. Epub 2016 Oct 14. J Neurosci Methods. 2016. PMID: 27751892 Review.
Cited by
-
Randomized Trials With Repeatedly Measured Outcomes: Handling Irregular and Potentially Informative Assessment Times.Epidemiol Rev. 2022 Dec 21;44(1):121-137. doi: 10.1093/epirev/mxac010. Epidemiol Rev. 2022. PMID: 36259969 Free PMC article. Review.
-
Inhaled nitric oxide in preterm infants: an individual-patient data meta-analysis of randomized trials.Pediatrics. 2011 Oct;128(4):729-39. doi: 10.1542/peds.2010-2725. Epub 2011 Sep 19. Pediatrics. 2011. PMID: 21930540 Free PMC article.
-
A model for repeated clustered data with informative cluster sizes.Stat Med. 2014 Feb 28;33(5):738-59. doi: 10.1002/sim.5988. Epub 2013 Sep 30. Stat Med. 2014. PMID: 24123049 Free PMC article.
-
Soluble markers of inflammation and coagulation but not T-cell activation predict non-AIDS-defining morbid events during suppressive antiretroviral treatment.J Infect Dis. 2014 Oct 15;210(8):1248-59. doi: 10.1093/infdis/jiu254. Epub 2014 May 1. J Infect Dis. 2014. PMID: 24795473 Free PMC article.
-
Randomized trial of azithromycin to eradicate Ureaplasma respiratory colonization in preterm infants: 2-year outcomes.Pediatr Res. 2022 Jan;91(1):178-187. doi: 10.1038/s41390-021-01437-2. Epub 2021 Mar 3. Pediatr Res. 2022. PMID: 33658655 Free PMC article. Clinical Trial.
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous