Hello,
I have an unbalanced panel dataset with 187,805 observations resulting from healthcare surveys taken every two or so years over a 16 year time period from 15 different European countries.
I am doing research on two dependent variables:
Following are my independent variables:
Since doctor visits is a count variable, rightly skewed and shows overdispersion, I proceeded with the following negbin model:
However, I am unable to cluster when adding vce(cluster id) at individual level for both fixed and random effects. Stata says vcetype cluster not allowed.
Also, the fixed effects negbin model estimates the gender variable, which in fixed effects regression shouldn't happen. I checked, no one in the dataset changed their gender. So, it is time-invariant.
I read in this forum that negbin model isn't recommended most of the time. So, I tried to run a regress with xtpoisson. Clustering works for random effects, but not for fixed effects.
Now, coming to second dependent variable, I am using the following command for a fixed effects logistic regression. However,I am still stuck at clustering at individual level. This command works for random effects, but not for fixed effects. Stata says vcetype cluster not allowed.
My plan was to use clustering for the above individual models. Then do a Hausman test on the models without clusters; which by the way prefer fixed effects.
I would be really grateful for advice on the above clustering issue.
Also, how should I proceed further if a variable category is significant in both FE and RE, but has opposite signs?
Thank in advance.
I have an unbalanced panel dataset with 187,805 observations resulting from healthcare surveys taken every two or so years over a 16 year time period from 15 different European countries.
I am doing research on two dependent variables:
Code:
- doctor visits in the previous 12 months (0 to 98 visits)
- hospital stays in the previous 12 months (yes or no)
Code:
- age (65 and above years of age)
- gender (1 = female)
- education (low, medium, high)
- household income (6 categories)
- disease level (none, 1-2 diseases, 3-4 diseases, 5 or more diseases)
- depression level (scale ranging from 0 to 12 - lowest to highest depression)
- lagged physical activities - using values of previous survey (more than once a week, once a week, once a month, never)
Code:
xtnbreg doctor_visits age i.female i.education i.hhincome i.disease_level depression_level i.physical_activities
Also, the fixed effects negbin model estimates the gender variable, which in fixed effects regression shouldn't happen. I checked, no one in the dataset changed their gender. So, it is time-invariant.
I read in this forum that negbin model isn't recommended most of the time. So, I tried to run a regress with xtpoisson. Clustering works for random effects, but not for fixed effects.
Now, coming to second dependent variable, I am using the following command for a fixed effects logistic regression. However,I am still stuck at clustering at individual level. This command works for random effects, but not for fixed effects. Stata says vcetype cluster not allowed.
Code:
xtlogit hospitalised age i.female i.education i.hhincome i.disease_level depression_level i.physical_activities
I would be really grateful for advice on the above clustering issue.
Also, how should I proceed further if a variable category is significant in both FE and RE, but has opposite signs?
Thank in advance.
Comment