Hi,
I have a cardinal dependent variable Y and a endogenous, binary, independent variable X and numerous control variables C.
If I run a standard OLS model, I get a significant, positive coefficient of 0.02 for X.
Now, I am implementing a 2-stage residual inclusion model where my first stage is a probit model X=(C, Z) with Z being my instrument.
Since I want to use the residuals from this stage I estimated the model as follows:
glm X C Z, fam(bin) link(probit)
predict Xhat, response
and then used the residuals in my original ols model:
reg Y X C Z Xhat
Now, I am getting a significant, negative coefficient for X of -4,25 and for the residual Xhat +4.49.
Can someone explain this drastic change of the coefficient?
Thanks a lot for your help.
I have a cardinal dependent variable Y and a endogenous, binary, independent variable X and numerous control variables C.
If I run a standard OLS model, I get a significant, positive coefficient of 0.02 for X.
Now, I am implementing a 2-stage residual inclusion model where my first stage is a probit model X=(C, Z) with Z being my instrument.
Since I want to use the residuals from this stage I estimated the model as follows:
glm X C Z, fam(bin) link(probit)
predict Xhat, response
and then used the residuals in my original ols model:
reg Y X C Z Xhat
Now, I am getting a significant, negative coefficient for X of -4,25 and for the residual Xhat +4.49.
Can someone explain this drastic change of the coefficient?
Thanks a lot for your help.
Comment