In a regression model with an explanatory variable (x) we have also the squared term of the variable x (sq_x) to capture possible nonlinearities in the relationship.
In order to check for endogeneity of x we use an instrumental variable (z) that is strongly correlated with the explanatory variable x. What kind of instrumental variable could be used for the squared term of x (sq_x) in the two stage least squares regression (TSLS/IV)?
I'm aware that in Wooldridge (2000), there is a nice discussion about this issue.
There are two approaches you can adopt. The first is to use the squares of the other exogenous variables as additional instruments. The second approach requires the use of (xhat)2 as instrument in the second-stage regression. Therefore, after obtaining (xhat) from first-stage regression and squaring this value, you must estimate:
ivreg2 y (x x2 = z (xhat)2 ) c
y-dependent
c- controls
Note that this approach adds a nonlinear function of your exogenous variables to your instrument set. The (xhat)2 should not be used as "regressor" in the second-stage model instead as an "instrument".
But I don't know how to implement this theory in Stata.
Thank you very much in advance for suggestions!
In order to check for endogeneity of x we use an instrumental variable (z) that is strongly correlated with the explanatory variable x. What kind of instrumental variable could be used for the squared term of x (sq_x) in the two stage least squares regression (TSLS/IV)?
I'm aware that in Wooldridge (2000), there is a nice discussion about this issue.
There are two approaches you can adopt. The first is to use the squares of the other exogenous variables as additional instruments. The second approach requires the use of (xhat)2 as instrument in the second-stage regression. Therefore, after obtaining (xhat) from first-stage regression and squaring this value, you must estimate:
ivreg2 y (x x2 = z (xhat)2 ) c
y-dependent
c- controls
Note that this approach adds a nonlinear function of your exogenous variables to your instrument set. The (xhat)2 should not be used as "regressor" in the second-stage model instead as an "instrument".
But I don't know how to implement this theory in Stata.
Thank you very much in advance for suggestions!
Comment