Hello Everyone,
I have a cross-sectional dataset and would like to estimate a Heckman sample selection model (heckprob: Probit model with sample selection) by using Stata 11. My data is hierarchical (households within villages and within regions). Therefore, I would like to include those variables, as random effects, for both the selection (probit) model and the second stage (probit) model.
Apparently, this can be done using gllamm command, though I did not figure out yet how to define the selection model.
My question is whether the following simple alternative could be valid:
1. Estimate the random effects (villages, regions) probit part of the model using xtprobit.
2. Calculate the inverse Mills ratio from the results, which equals
Invmills = normalden(linear_pred)/normal(linear_pred)
3. Include the Mills as an additionally explanatory variable in the second stage regression to control for selectivity bias by using either xtprobit or gllamm including again the random effects (villages, regions)
In a 2005 post from Statalist I have seen that this approach might not be adequate for panel data (measurements over time) with fixed effects. As my data is different I wonder: - Is this approach correct? - Is there any other better approach I could use? Many thanks in advance for your help!
Comment