Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018;45(9):1714-1733.
doi: 10.1080/02664763.2017.1391180. Epub 2017 Oct 24.

Untangle the Structural and Random Zeros in Statistical Modelings

Affiliations

Untangle the Structural and Random Zeros in Statistical Modelings

W Tang et al. J Appl Stat. 2018.

Abstract

Count data with structural zeros are common in public health applications. There are considerable researches focusing on zero-inflated models such as zero-inflated Poisson (ZIP) and zero-inflated Negative Binomial (ZINB) models for such zero-inflated count data when used as response variable. However, when such variables are used as predictors, the difference between structural and random zeros is often ignored and may result in biased estimates. One remedy is to include an indicator of the structural zero in the model as a predictor if observed. However, structural zeros are often not observed in practice, in which case no statistical method is available to address the bias issue. This paper is aimed to fill this methodological gap by developing parametric methods to model zero-inflated count data when used as predictors based on the maximum likelihood approach. The response variable can be any type of data including continuous, binary, count or even zero-inflated count responses. Simulation studies are performed to assess the numerical performance of this new approach when sample size is small to moderate. A real data example is also used to demonstrate the application of this method.

Keywords: generalized linear models; maximum likelihood; structural zeros; zero-inflated Poisson; zero-inflated explanatory variables.

PubMed Disclaimer

References

    1. BoÈhning D, Zero-inflated poisson models and ca man: A tutorial collection of evidence, Biometrical Journal 40 (1998), pp. 833–843.
    1. Buu A, Johnson N, Li R, and Tan X, New variable selection methods for zero-inflated count data with applications to the substance abuse field, Statistics in medicine 30 (2011), pp. 2326–2340. - PMC - PubMed
    1. Calsyn DA, Hatch-Maillette M, Tross S, Doyle SR, Crits-Christoph P, Song YS, Harrer JM, Lalos G, and Berns SB, Motivational and skills training hiv/sexually transmitted infection sexual risk reduction groups for men, Journal of Substance Abuse Treatment 37 (2009), pp. 138–150. - PMC - PubMed
    1. Connor J, Kypri K, Bell M, and Cousins K, Alcohol outlet density, levels of drinking and alcohol-related harm in new zealand: a national study, Journal of epidemiology and community health 65 (2011), pp. 841–846. - PubMed
    1. Cranford J, Zucker R, Jester J, Puttler L, and Fitzgerald H, Parental alcohol involvement and adolescent alcohol expectancies predict alcohol involvement in male adolescents., Psychology of Addictive Behaviors; Psychology of Addictive Behaviors 24 (2010), pp. 386–396. - PMC - PubMed