Hi all,
I have a large dataset over 200,000 and I'm looking at count data so I'm considering poisson and negative binomial models. For postestimation model diagnostics I have read 'estat gof' in Stata manual 13 can be used but I am only able to get it to work with poisson and not negative binomial (it says invalid subcommand gof in Stata 13.1). I have lots of zeros in the data but I'm looking at Hospital admissions so there is no reason why a person would not enter hospital for the condition I'm looking at.
When I run the estat gof for a univariate poisson model I get the following, I've run several models with different dependent and independent variables where in other modelling I know relationships exist but get similar results with estat gof:
Deviance goodness of fit = 180000.1
Prob > chi2 (199,996) = 1.000
Pearson goodness of fit = 440000.2
Prob > chi2 (199,996) = 0.000
(numbers only approx., but p values are what is in the output)
The manual says both should be non-significant and goes on to model interaction and combines categories. I've also run with a two category independent variable and get similar results.
Has anyone else had an issue with this? Shall I ignore and just try other diagnostics? Do you suggest any in particular from experience on large live datasets?
many thanks,
Annette
I have a large dataset over 200,000 and I'm looking at count data so I'm considering poisson and negative binomial models. For postestimation model diagnostics I have read 'estat gof' in Stata manual 13 can be used but I am only able to get it to work with poisson and not negative binomial (it says invalid subcommand gof in Stata 13.1). I have lots of zeros in the data but I'm looking at Hospital admissions so there is no reason why a person would not enter hospital for the condition I'm looking at.
When I run the estat gof for a univariate poisson model I get the following, I've run several models with different dependent and independent variables where in other modelling I know relationships exist but get similar results with estat gof:
Deviance goodness of fit = 180000.1
Prob > chi2 (199,996) = 1.000
Pearson goodness of fit = 440000.2
Prob > chi2 (199,996) = 0.000
(numbers only approx., but p values are what is in the output)
The manual says both should be non-significant and goes on to model interaction and combines categories. I've also run with a two category independent variable and get similar results.
Has anyone else had an issue with this? Shall I ignore and just try other diagnostics? Do you suggest any in particular from experience on large live datasets?
many thanks,
Annette
Comment