On the Use of Auxiliary Variables in Multilevel Regression and Poststratification
- PMID: 40476050
- PMCID: PMC12140408
- DOI: 10.1214/24-sts932
On the Use of Auxiliary Variables in Multilevel Regression and Poststratification
Abstract
Multilevel regression and poststratification (MRP) is a popular method for addressing selection bias in subgroup estimation, with broad applications across fields from social sciences to public health. In this paper, we examine the inferential validity of MRP in finite populations, exploring the impact of poststratification and model specification. The success of MRP relies heavily on the availability of auxiliary information that is strongly related to the outcome. To enhance the fitting performance of the outcome model, we recommend modeling the inclusion probabilities conditionally on auxiliary variables and incorporating flexible functions of estimated inclusion probabilities as predictors in the mean structure. We present a statistical data integration framework that offers robust inferences for probability and nonprobability surveys, addressing various challenges in practical applications. Our simulation studies indicate the statistical validity of MRP, which involves a tradeoff between bias and variance, with greater benefits for subgroup estimates with small sample sizes, compared to alternative methods. We have applied our methods to the Adolescent Brain Cognitive Development (ABCD) Study, which collected information on children across 21 geographic locations in the U.S. to provide national representation, but is subject to selection bias as a nonprobability sample. We focus on the cognition measure of diverse groups of children in the ABCD study and show that the use of auxiliary variables affects the findings on cognitive performance.
Keywords: data integration; model-based; nonprobability sample; robust inference; selection/nonresponse bias.
Figures




Similar articles
-
Technological aids for the rehabilitation of memory and executive functioning in children and adolescents with acquired brain injury.Cochrane Database Syst Rev. 2016 Jul 1;7(7):CD011020. doi: 10.1002/14651858.CD011020.pub2. Cochrane Database Syst Rev. 2016. PMID: 27364851 Free PMC article.
-
Education support services for improving school engagement and academic performance of children and adolescents with a chronic health condition.Cochrane Database Syst Rev. 2023 Feb 8;2(2):CD011538. doi: 10.1002/14651858.CD011538.pub2. Cochrane Database Syst Rev. 2023. PMID: 36752365 Free PMC article.
-
Psychological interventions for adults who have sexually offended or are at risk of offending.Cochrane Database Syst Rev. 2012 Dec 12;12(12):CD007507. doi: 10.1002/14651858.CD007507.pub2. Cochrane Database Syst Rev. 2012. PMID: 23235646 Free PMC article.
-
Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2. Cochrane Database Syst Rev. 2018. PMID: 29357120 Free PMC article.
-
Factors that impact on the use of mechanical ventilation weaning protocols in critically ill adults and children: a qualitative evidence-synthesis.Cochrane Database Syst Rev. 2016 Oct 4;10(10):CD011812. doi: 10.1002/14651858.CD011812.pub2. Cochrane Database Syst Rev. 2016. PMID: 27699783 Free PMC article.
References
-
- Baker R, Brick JM, Bates NA, Battaglia M, Couper MP, Dever JA, Gile KJ, and Tourangeau R (2013). Summary report of the AAPOR Task Force on non-probability sampling. Journal of Survey Statistics and Methodology 1(2), 90–143.
-
- Bang H and Robins JM (2005). Doubly robust estimation in missing data and causal inference models. Biometrics 61, 962–972. - PubMed
-
- Bethlehem JG (2002). Weighting nonresponse adjustments based on auxiliary information. In Groves RM, Dillman DA, Eltinge JL, and Little RJA (Eds.), Survey Nonresponse. Wiley.
-
- Bradley R and Corwyn R (2002). Socioeconomic status and child development. Annu Rev Psychol 53, 371–399. - PubMed
-
- Breidt F and Opsomer J (2017). Model-assisted survey estimation with modern prediction techniques. Statistical Science 32, 190–205.
Grants and funding
LinkOut - more resources
Full Text Sources