Dear Statalists,
I am dealing with a panel dataset in which I want to estimate the territories' patenting activity on the basis of some performance indicators. I have a balanced panel dataset of 32 towns for 14 years and two outcomes of interest (SIPO and EPO). The following are the summary statistics of the outcomes. I also attach the kernel density of the variables.
Here are my questions:
1. The variable EPO shows a pretty clear negative binomial form, so I am almost sure it makes sense to use panel negative binomial panel model - xtbnreg - to analyse the phenomenon. Is the overdispersion sufficient enough to support my choice?
2. I have more doubts about the SIPO variable. It still show a poisson-like distribution and it seems to me it is still overdispersed given the magnitude of the standard deviation, but it does not show any zero in the distribution. Given this, I am not sure whether I should consider the zero-truncated version. In my case, the value zero could occur but it does not actually occur in my data. Furthrmore, I could not see any stata command that gives me the possibility a zero-truncated negative binomial panel regression. Is correct also in this case to stick with the xtnbreg? In case a zero-truncated negative binomial should be chosen, is there any Stata command to perform a panel version?
3. I observed that the within variability of both my outcomes is larger than the between one. Except for considering a RE version, does this fact has any importance for the choice of the model (binomial instead of poisson)?
Thank you to anyone who will help!
Chiara
I am dealing with a panel dataset in which I want to estimate the territories' patenting activity on the basis of some performance indicators. I have a balanced panel dataset of 32 towns for 14 years and two outcomes of interest (SIPO and EPO). The following are the summary statistics of the outcomes. I also attach the kernel density of the variables.
Var | Mean | St. Dev | Min | Max |
SIPO | 420.0725 | 612.1378 | 2 | 5670 |
EPO | 2.162637 | 4.281392 | 0 | 35 |
1. The variable EPO shows a pretty clear negative binomial form, so I am almost sure it makes sense to use panel negative binomial panel model - xtbnreg - to analyse the phenomenon. Is the overdispersion sufficient enough to support my choice?
2. I have more doubts about the SIPO variable. It still show a poisson-like distribution and it seems to me it is still overdispersed given the magnitude of the standard deviation, but it does not show any zero in the distribution. Given this, I am not sure whether I should consider the zero-truncated version. In my case, the value zero could occur but it does not actually occur in my data. Furthrmore, I could not see any stata command that gives me the possibility a zero-truncated negative binomial panel regression. Is correct also in this case to stick with the xtnbreg? In case a zero-truncated negative binomial should be chosen, is there any Stata command to perform a panel version?
3. I observed that the within variability of both my outcomes is larger than the between one. Except for considering a RE version, does this fact has any importance for the choice of the model (binomial instead of poisson)?
PHP Code:
xtsum SIPO EPO
Variable | Mean Std. Dev. Min Max | Observations
-----------------+--------------------------------------------+----------------
SIPO_P~l overall | 420.0725 612.1378 2 5670 | N = 455
between | 447.4541 65.71429 1963.714 | n = 33
within | 461.8728 -1285.642 4436.644 | T-bar = 13.7879
| |
EPO_Pa~l overall | 2.162637 4.281392 0 35 | N = 455
between | 2.335975 0 9.928571 | n = 33
within | 3.607863 -7.765934 27.23407 | T-bar = 13.7879
Chiara
Comment