Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Clinical Trial
. 2011 Sep;37(5):376-82.
doi: 10.3109/00952990.2011.568080.

Some considerations for excess zeroes in substance abuse research

Affiliations
Clinical Trial

Some considerations for excess zeroes in substance abuse research

Dipankar Bandyopadhyay et al. Am J Drug Alcohol Abuse. 2011 Sep.

Abstract

Background: Count data collected in substance abuse research often come with an excess of "zeroes," which are typically handled using zero-inflated regression models. However, there is a need to consider the design aspects of those studies before using such a statistical model to ascertain the sources of zeroes.

Objectives: We sought to illustrate hurdle models as alternatives to zero-inflated models to validate a two-stage decision-making process in situations of "excess zeroes."

Methods: We use data from a study of 45 cocaine-dependent subjects where the primary scientific question was to evaluate whether study participation influences drug-seeking behavior. The outcome, "the frequency (count) of cocaine use days per week," is bounded (ranging from 0 to 7). We fit and compare binomial, Poisson, negative binomial, and the hurdle version of these models to study the effect of gender, age, time, and study participation on cocaine use.

Results: The hurdle binomial model provides the best fit. Gender and time are not predictive of use. Higher odds of use versus no use are associated with age; however once use is experienced, odds of further use decrease with increase in age. Participation was associated with higher odds of no-cocaine use; once there is use, participation reduced the odds of further use.

Conclusion: Age and study participation are significantly predictive of cocaine-use behavior.

Scientific significance: The two-stage decision process as modeled by a hurdle binomial model (appropriate for bounded count data with excess zeroes) provides interesting insights into the study of covariate effects on count responses of substance use, when all enrolled subjects are believed to be "at-risk" of use.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Plots of cocaine intake data. The left panel displays density histogram of the raw counts (frequency of cocaine intake). The right panel compares plots of “Observed-Expected predicted probabilities” for each of the counts for the 5 competing models.

References

    1. Alfò M, Maruotti A. Two-part regression models for longitudinal zero-inflated count data. The Canadian Journal of Statistics. 2010;38(2):197–216.
    1. Cameron AC, Trivedi PK. Regression analysis of count data. Cambridge: Cambridge University Press; 1998.
    1. Delucchi KL, Bostrom A. Methods for analysis of skewed data distributions in psychiatric clinical studies: working with many zero values. American Journal of Psychiatry. 2004;161:1159–1168. - PubMed
    1. Booth RE, Watters JK, Chitwood DD. HIV risk-related sex behaviors among injection drug users, crack smokers, and injection drug users who smoke crack. American Journal of Public Health. 1993;83:1144–1148. - PMC - PubMed
    1. Hien DA, Campbell ANC, Killen T, Hu M-C, Hansen C, Jiang H, Hatch-Mailette M, Miele GM, Cohen LR, Gan W, Resko SM, DiBono M, Wells EA, Nunes EV. The impact of trauma-focussed group theraphy upon HIV sexual risk behaviors in the NIDA clinical trials network “Women and Trauma” multi-site study. AIDS and Behavior. 2010;14(2):421–430. - PMC - PubMed

Publication types