Advanced search
Start date
Betweenand
(Reference retrieved automatically from Web of Science through information on FAPESP grant and its corresponding number as mentioned in the publication by the authors.)

A New Regression Model for the Analysis of Overdispersed and Zero-Modified Count Data

Full text
Author(s):
Bertoli, Wesley [1] ; Conceicao, Katiane S. [2] ; Andrade, Marinho G. [2] ; Louzada, Francisco [2]
Total Authors: 4
Affiliation:
[1] Fed Univ Technol, Dept Stat, Av Sete Setembro, 3165 Reboucas, BR-80230901 Curitiba, Parana - Brazil
[2] Univ Sao Paulo, Inst Math & Comp Sci, Dept Appl Math & Stat, Av Trab Sao Carlense, 400 Parque Arnold Schimidt, BR-13566590 Sao Carlos, SP - Brazil
Total Affiliations: 2
Document type: Journal article
Source: Entropy; v. 23, n. 6 JUN 2021.
Web of Science Citations: 0
Abstract

Count datasets are traditionally analyzed using the ordinary Poisson distribution. However, said model has its applicability limited, as it can be somewhat restrictive to handling specific data structures. In this case, the need arises for obtaining alternative models that accommodate, for example, overdispersion and zero modification (inflation/deflation at the frequency of zeros). In practical terms, these are the most prevalent structures ruling the nature of discrete phenomena nowadays. Hence, this paper's primary goal was to jointly address these issues by deriving a fixed-effects regression model based on the hurdle version of the Poisson-Sujatha distribution. In this framework, the zero modification is incorporated by considering that a binary probability model determines which outcomes are zero-valued, and a zero-truncated process is responsible for generating positive observations. Posterior inferences for the model parameters were obtained from a fully Bayesian approach based on the g-prior method. Intensive Monte Carlo simulation studies were performed to assess the Bayesian estimators' empirical properties, and the obtained results have been discussed. The proposed model was considered for analyzing a real dataset, and its competitiveness regarding some well-established fixed-effects models for count data was evaluated. A sensitivity analysis to detect observations that may impact parameter estimates was performed based on standard divergence measures. The Bayesian p-value and the randomized quantile residuals were considered for the task of model validation. (AU)

FAPESP's process: 13/07375-0 - CeMEAI - Center for Mathematical Sciences Applied to Industry
Grantee:Francisco Louzada Neto
Support Opportunities: Research Grants - Research, Innovation and Dissemination Centers - RIDC
FAPESP's process: 19/21766-8 - Discrete data time series models
Grantee:Marinho Gomes de Andrade Filho
Support Opportunities: Scholarships abroad - Research
FAPESP's process: 19/22412-5 - Multiple modification points in discrete data models
Grantee:Katiane Silva Conceição
Support Opportunities: Scholarships abroad - Research