Dear experts,
I am working on a project that analyses the effect of a binary endogenous regressor in combination with other independent variables and their effect on a non-binary outcome variable (Y1). In order to control for endogeneity, I employ two different methods. In a first step, I use the instrumental variables approach to account for the endogenous nature of my dummy regressor. In a second step, I account for endogeneity by using a control function (CF) approach and an inverse mills type of control factor (generalized residuals (Gourieroux et.al. (1987)). However, I was unable to find a real "instruction" on the application of the CF within stata. Hence, I would like to ask whether my current setting correctly accounts for I) endogeneity and II) to test the robustness of my IV estimates. Here is what I do:
1) I calculate a probit model in which I regress variables which I think have an effect on the chance to "receive" the endogeneous binary variable.
-> probit Y2, X1, X2, X3, Xn
2) From that regression I calculate the inverse mills ratio to account for a possible selection bias using the following formula:
I am wondering if my approach justifies my assumtpion to account for endogeneity correctly and that my results are not (as) biased as they would be when using my OLS model with just the endogenous binary regressor and no controls for endogeneity (as I do in column 2). Second, I wonder if my results are somehow biased since I include both, the inverse mills ratio as well as the generalized residuals which are both based on the same probit regression.
Would you agree with my approach or do you find improvements for my argumentation?
Your advice is much appreciated.
Thank you and best regards,
Alex
I am working on a project that analyses the effect of a binary endogenous regressor in combination with other independent variables and their effect on a non-binary outcome variable (Y1). In order to control for endogeneity, I employ two different methods. In a first step, I use the instrumental variables approach to account for the endogenous nature of my dummy regressor. In a second step, I account for endogeneity by using a control function (CF) approach and an inverse mills type of control factor (generalized residuals (Gourieroux et.al. (1987)). However, I was unable to find a real "instruction" on the application of the CF within stata. Hence, I would like to ask whether my current setting correctly accounts for I) endogeneity and II) to test the robustness of my IV estimates. Here is what I do:
1) I calculate a probit model in which I regress variables which I think have an effect on the chance to "receive" the endogeneous binary variable.
-> probit Y2, X1, X2, X3, Xn
2) From that regression I calculate the inverse mills ratio to account for a possible selection bias using the following formula:
- predict Dummy, xb
gen Invmills = normalden(Dummy)/normal(Dummy)
- predict xb, xb
- gen Lambda = cond(Endog. binary regressor == 1,normalden(xb)/normal(xb), -normalden(xb)/(1-normal(xb)))
I am wondering if my approach justifies my assumtpion to account for endogeneity correctly and that my results are not (as) biased as they would be when using my OLS model with just the endogenous binary regressor and no controls for endogeneity (as I do in column 2). Second, I wonder if my results are somehow biased since I include both, the inverse mills ratio as well as the generalized residuals which are both based on the same probit regression.
Would you agree with my approach or do you find improvements for my argumentation?
Your advice is much appreciated.
Thank you and best regards,
Alex
Comment