Study of Zero-Inflated Regression Models in a Large-Scale Population Survey of Sub-Health Status and Its Influencing Factors
- PMID: 29301596
- DOI: 10.24920/J1001-9294.2017.054
Study of Zero-Inflated Regression Models in a Large-Scale Population Survey of Sub-Health Status and Its Influencing Factors
Abstract
Objective Sub-health status has progressively gained more attention from both medical professionals and the publics. Treating the number of sub-health symptoms as count data rather than dichotomous data helps to completely and accurately analyze findings in sub-healthy population. This study aims to compare the goodness of fit for count outcome models to identify the optimum model for sub-health study. Methods The sample of the study derived from a large-scale population survey on physiological and psychological constants from 2007 to 2011 in 4 provinces and 2 autonomous regions in China. We constructed four count outcome models using SAS: Poisson model, negative binomial (NB) model, zero-inflated Poisson (ZIP) model and zero-inflated negative binomial (ZINB) model. The number of sub-health symptoms was used as the main outcome measure. The alpha dispersion parameter and O test were used to identify over-dispersed data, and Vuong test was used to evaluate the excessive zero count. The goodness of fit of regression models were determined by predictive probability curves and statistics of likelihood ratio test. Results Of all 78 307 respondents, 38.53% reported no sub-health symptoms. The mean number of sub-health symptoms was 2.98, and the standard deviation was 3.72. The statistic O in over-dispersion test was 720.995 (P<0.001); the estimated alpha was 0.618 (95% CI: 0.600-0.636) comparing ZINB model and ZIP model; Vuong test statistic Z was 45.487. These results indicated over-dispersion of the data and excessive zero counts in this sub-health study. ZINB model had the largest log likelihood (-167 519), the smallest Akaike's Information Criterion coefficient (335 112) and the smallest Bayesian information criterion coefficient (335455), indicating its best goodness of fit. The predictive probabilities for most counts in ZINB model fitted the observed counts best. The logit section of ZINB model analysis showed that age, sex, occupation, smoking, alcohol drinking, ethnicity and obesity were determinants for presence of sub-health symptoms; the binomial negative section of ZINB model analysis showed that sex, occupation, smoking, alcohol drinking, ethnicity, marital status and obesity had significant effect on the severity of sub-health. Conclusions All tests for goodness of fit and the predictive probability curve produced the same finding that ZINB model was the optimum model for exploring the influencing factors of sub-health symptoms.
Similar articles
-
Study of depression influencing factors with zero-inflated regression models in a large-scale population survey.BMJ Open. 2017 Nov 28;7(11):e016471. doi: 10.1136/bmjopen-2017-016471. BMJ Open. 2017. PMID: 29187409 Free PMC article.
-
[Application of zero-inflated models on regression analysis of count data: a study of sub-health symptoms].Zhonghua Liu Xing Bing Xue Za Zhi. 2011 Feb;32(2):187-91. Zhonghua Liu Xing Bing Xue Za Zhi. 2011. PMID: 21518631 Chinese.
-
Zero inflated statistical count models for analysing the costs imposed by GERD and dyspepsia.Arab J Gastroenterol. 2013 Dec;14(4):165-8. doi: 10.1016/j.ajg.2013.09.004. Epub 2013 Nov 28. Arab J Gastroenterol. 2013. PMID: 24433646
-
Comparison of linear and zero-inflated negative binomial regression models for appraisal of risk factors associated with dental caries.J Indian Soc Pedod Prev Dent. 2016 Jan-Mar;34(1):71-5. doi: 10.4103/0970-4388.175521. J Indian Soc Pedod Prev Dent. 2016. PMID: 26838152 Review.
-
A comparison of zero-inflated and hurdle models for modeling zero-inflated count data.J Stat Distrib Appl. 2021;8(1):8. doi: 10.1186/s40488-021-00121-4. Epub 2021 Jun 24. J Stat Distrib Appl. 2021. PMID: 34760432 Free PMC article. Review.
Cited by
-
Prevalence of metabolic syndrome among ethnic groups in China.BMC Public Health. 2020 Mar 6;20(1):297. doi: 10.1186/s12889-020-8393-6. BMC Public Health. 2020. PMID: 32143667 Free PMC article.
-
A time-trend ecological study for identifying flood-sensitive infectious diseases in Guangxi, China from 2005 to 2012.Environ Res. 2019 Sep;176:108577. doi: 10.1016/j.envres.2019.108577. Epub 2019 Jul 5. Environ Res. 2019. PMID: 31306984 Free PMC article.
-
The Effect of Psychological Contract Combined With Stress and Health on Employees' Management Behavior.Front Psychol. 2021 Jun 10;12:667302. doi: 10.3389/fpsyg.2021.667302. eCollection 2021. Front Psychol. 2021. PMID: 34177726 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Other Literature Sources