Some considerations for excess zeroes in substance abuse research
- PMID: 21854280
- PMCID: PMC3297079
- DOI: 10.3109/00952990.2011.568080
Some considerations for excess zeroes in substance abuse research
Abstract
Background: Count data collected in substance abuse research often come with an excess of "zeroes," which are typically handled using zero-inflated regression models. However, there is a need to consider the design aspects of those studies before using such a statistical model to ascertain the sources of zeroes.
Objectives: We sought to illustrate hurdle models as alternatives to zero-inflated models to validate a two-stage decision-making process in situations of "excess zeroes."
Methods: We use data from a study of 45 cocaine-dependent subjects where the primary scientific question was to evaluate whether study participation influences drug-seeking behavior. The outcome, "the frequency (count) of cocaine use days per week," is bounded (ranging from 0 to 7). We fit and compare binomial, Poisson, negative binomial, and the hurdle version of these models to study the effect of gender, age, time, and study participation on cocaine use.
Results: The hurdle binomial model provides the best fit. Gender and time are not predictive of use. Higher odds of use versus no use are associated with age; however once use is experienced, odds of further use decrease with increase in age. Participation was associated with higher odds of no-cocaine use; once there is use, participation reduced the odds of further use.
Conclusion: Age and study participation are significantly predictive of cocaine-use behavior.
Scientific significance: The two-stage decision process as modeled by a hurdle binomial model (appropriate for bounded count data with excess zeroes) provides interesting insights into the study of covariate effects on count responses of substance use, when all enrolled subjects are believed to be "at-risk" of use.
Figures
References
-
- Alfò M, Maruotti A. Two-part regression models for longitudinal zero-inflated count data. The Canadian Journal of Statistics. 2010;38(2):197–216.
-
- Cameron AC, Trivedi PK. Regression analysis of count data. Cambridge: Cambridge University Press; 1998.
-
- Delucchi KL, Bostrom A. Methods for analysis of skewed data distributions in psychiatric clinical studies: working with many zero values. American Journal of Psychiatry. 2004;161:1159–1168. - PubMed
-
- Hien DA, Campbell ANC, Killen T, Hu M-C, Hansen C, Jiang H, Hatch-Mailette M, Miele GM, Cohen LR, Gan W, Resko SM, DiBono M, Wells EA, Nunes EV. The impact of trauma-focussed group theraphy upon HIV sexual risk behaviors in the NIDA clinical trials network “Women and Trauma” multi-site study. AIDS and Behavior. 2010;14(2):421–430. - PMC - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical