Statistical models for longitudinal zero-inflated count data with applications to the substance abuse field (original) (raw)

This study fills in the current knowledge gaps in statistical analysis of longitudinal zero-inflated count data by providing a comprehensive review and comparison of the hurdle and zero-inflated Poisson models in terms of the conceptual framework, computational advantage, and performance under different real data situations. The design of simulations represents the special features of a wellknown longitudinal study of alcoholism so that the results can be generalizable to the substance abuse field. When the hurdle model is more natural under the conceptual framework of the data, the zeroinflated Poisson model tends to produce inaccurate estimates. Model performance improves with larger sample sizes, lower proportions of missing data, and lower correlations between covariates. The simulation also shows that the computational strength of the hurdle model disappears when random effects are included. Keywords hurdle model; zero-inflated Poisson model; random effect; regression spline have excess zero values beyond what would be expected by a classical Poisson model, especially when the sample is drawn from the general population or a community [3].