Dear Statalisters,
I have been trying to properly understand intuition and technicalities behind the so called “NB1” and “NB2” negative binomial regression models (Cameron and Trivedi, 1986 and 2013), which are implemented by Stata respectively with the nbreg, ´dispersion(constant)´ and ´nbreg, dispersion(mean)´ commands.
One hurdle I face is understanding the meaning of the terms “dispersion for the jth observation” and “model dispersion”, which are used interchangeably the Stata manual for nbreg (“Introduction to negative binomial regression”) and in this Stata FAQ page https://www.stata.com/support/faqs/s...ance-function/
More specifically, it seems to be the case that:
FOR MODEL NB2 (“mean dispersion”), the variance of event counts is Var(y_j) = mu_j * (1 + alpha * mu_j) and the so-called "model dispersion" or "dispersion for the jth observation" is (1 + alpha* mu_j)
FOR MODEL NB1 (“constant dispersion”), the variance of event counts is Var(y_j) = mu_j * (1 + delta) and the so-called "model dispersion" or "dispersion for the jth observation" is (1 + delta)
So while I understand how the variance of counts is derived, I do not understand what those "dispersion" terms exactly refer to. What is the "dispersion for the jth observation" or "model dispersion", and how is it derived? Cameron and Trivedi do not seem to talk about it at all, in their book.
It must be an important concept, since those terms are giving the name to the two model options in the Stata literature ("mean" and "constant" dispersion models). On the other hand, the variance of observed counts depends on the mean mu_j in BOTH models, so none of them seems to have "constant dispersion" in that sense.
Thank you very much in advance to those who will help!
Zelda
I have been trying to properly understand intuition and technicalities behind the so called “NB1” and “NB2” negative binomial regression models (Cameron and Trivedi, 1986 and 2013), which are implemented by Stata respectively with the nbreg, ´dispersion(constant)´ and ´nbreg, dispersion(mean)´ commands.
One hurdle I face is understanding the meaning of the terms “dispersion for the jth observation” and “model dispersion”, which are used interchangeably the Stata manual for nbreg (“Introduction to negative binomial regression”) and in this Stata FAQ page https://www.stata.com/support/faqs/s...ance-function/
More specifically, it seems to be the case that:
FOR MODEL NB2 (“mean dispersion”), the variance of event counts is Var(y_j) = mu_j * (1 + alpha * mu_j) and the so-called "model dispersion" or "dispersion for the jth observation" is (1 + alpha* mu_j)
FOR MODEL NB1 (“constant dispersion”), the variance of event counts is Var(y_j) = mu_j * (1 + delta) and the so-called "model dispersion" or "dispersion for the jth observation" is (1 + delta)
So while I understand how the variance of counts is derived, I do not understand what those "dispersion" terms exactly refer to. What is the "dispersion for the jth observation" or "model dispersion", and how is it derived? Cameron and Trivedi do not seem to talk about it at all, in their book.
It must be an important concept, since those terms are giving the name to the two model options in the Stata literature ("mean" and "constant" dispersion models). On the other hand, the variance of observed counts depends on the mean mu_j in BOTH models, so none of them seems to have "constant dispersion" in that sense.
Thank you very much in advance to those who will help!
Zelda

Comment