Quantitative methods for descriptive intersectional analysis with binary health outcomes
- PMID: 35118188
- PMCID: PMC8800141
- DOI: 10.1016/j.ssmph.2022.101032
Quantitative methods for descriptive intersectional analysis with binary health outcomes
Abstract
Intersectionality recognizes that in the context of sociohistorically shaped structural power relations, an individual's multiple social positions or identities (e.g., gender, ethnicity) can interact to affect health-related outcomes. Despite limited methodological guidance, intersectionality frameworks have increasingly been incorporated into epidemiological studies, both to describe health disparities and to examine their causes. This study aimed to advance methods in intersectional estimation of binary outcomes in descriptive health disparities research through evaluation of 7 potentially intersectional data analysis methods: cross-classification, regression with interactions, multilevel analysis of individual heterogeneity (MAIHDA), and decision trees (CART, CTree, CHAID, random forest). Accuracy of estimated intersection-specific outcome prevalence was evaluated across 192 intersections using simulated data scenarios. For comparison we included a non-intersectional main effects regression. We additionally assessed variable selection performance amongst decision trees. Example analyses using National Health and Nutrition Examination Study data illustrated differences in results between methods. At larger sample sizes, all methods except for CART performed better than non-intersectional main effects regression. In smaller samples, MAIHDA was the most accurate method but showed no advantage over main effects regression, while random forest, cross-classification, and saturated regression were the least accurate, and CTree and CHAID performed moderately well. CART performed poorly for estimation and variable selection. Sensitivity analyses examining the bias-variance tradeoff suggest MAIHDA as the preferred unbiased method for accurate estimation of high-dimensional intersections at smaller sample sizes. Larger sample sizes are more imperative for other methods. Results support the adoption of an intersectional approach to descriptive epidemiology.
Keywords: Biostatistics; CART, classification and regression tree; CHAID, chi-square automatic interaction detector; CTree, conditional inference trees; Epidemiological studies; Health equity; Intersectionality; MAD, mean absolute deviation; MAIHDA, multilevel analysis of individual heterogeneity and discriminatory accuracy; NHANES, National Health and Nutrition Examination Study; Research design; SD, standard deviation; U.S., United States; VIM, variable importance measure.
© 2022 The Authors.
Conflict of interest statement
The authors declare no conflicts of interest.
Figures
References
-
- Bates D., Mächler M., Bolker B., Walker S. Fitting linear mixed-effects models using lme4. Journal of Statistical Software. 2015;67(1):1–48. doi: 10.18637/JSS.V067·I01. - DOI
LinkOut - more resources
Full Text Sources
Miscellaneous
