Dear Stata users
I am having issues with bootstrapping standard errors after including regional fixed effected in my regressions. I know this has been commented on some previous posts and it seems to be a recurrent issue but I have not yet been able to solve my problem with the info posted there.
If I run my command without regional fixed effects as
Stata has no problem and all 1000 thousand replications are successful. However, if I run instead:
many replications fail. Using the noisily option delivers the usual message: collinearity in replicate sample is not the same as the full sample, posting missing values
insufficient observations to compute bootstrap standard errors no results will be saved
My bet is that when a new sample is drawn, observations for a particular region can be missing and hence cause the problem (given that some regions have a small number of observations, this is quite possible). My sample size is quite big and the number of regions is low, so that I am confident this is not a problem of insufficient degrees of freedom.
I have read about using the nodrop command, but this does not work. If what is happening is the problem I just mentioned (some regions not appearing in some replications), then I was thinking about specifying more than 1000 replications and keeping the first thousand successful ones. This may look a bit of a dirty practice, but to the extent that it is only due to some regions being minor in terms of observations in the dataset, it could be a solution. Does anyone know how to keep the first 1000 succesful replications our of (say) 3000?
Many thanks in advance for your help,
Basile
I am having issues with bootstrapping standard errors after including regional fixed effected in my regressions. I know this has been commented on some previous posts and it seems to be a recurrent issue but I have not yet been able to solve my problem with the info posted there.
If I run my command without regional fixed effects as
Code:
bootstrap, reps(1000) seed(8976): probit y x z, cluster(hh)
Code:
bootstrap, reps(1000) seed(8976): probit y x z i.region, cluster(hh)
insufficient observations to compute bootstrap standard errors no results will be saved
My bet is that when a new sample is drawn, observations for a particular region can be missing and hence cause the problem (given that some regions have a small number of observations, this is quite possible). My sample size is quite big and the number of regions is low, so that I am confident this is not a problem of insufficient degrees of freedom.
I have read about using the nodrop command, but this does not work. If what is happening is the problem I just mentioned (some regions not appearing in some replications), then I was thinking about specifying more than 1000 replications and keeping the first thousand successful ones. This may look a bit of a dirty practice, but to the extent that it is only due to some regions being minor in terms of observations in the dataset, it could be a solution. Does anyone know how to keep the first 1000 succesful replications our of (say) 3000?
Many thanks in advance for your help,
Basile

Comment