Hi all,
Apologies if I am double-posting, but I have struggled to find clear guidance or a post directly related to the below IV related questions - if someone could point me in the right direction I would greatly appreciate it.
I am implementing an IV estimation using the control function method, with different outcome variables Y. Some models are poisson, others OLS, but the endogenous regressor and the instrument are the same. I have always thought that
- in all IV estimations, the first and second stage need to include the same control variables
- if the second stage includes an interaction term between the endogenous regressor and some other variable, I need to have two first stages: one on the endogenous regressor itself, the other on the interaction term, where the instrument is interacted with the same variable.
I would like to confirm if the above are necessarily always correct. When using the control function method, I am including the residual from the first stage in the second stage, so would this not control for the endogenous part of the endogenous regressor as well as its interacted version? I understand that each additional endogenous regressor requires an additional instrument, but is an interaction of the same variable really an additional endogenous regressor? Moreover, if the control variables used in the second stage are not required for the validity of the instrument, do they need to be in the first stage?
So, if the second stage is
Y = b0 + b1 X1 + b2 X1*X2 + b3 X2 + b4 X3 + u + e
where u is the residual from a regression
X1 = a0 + a1 Z + u
would I need to also run
X1*X2 = a0 + a1 Z*X2 + u_int
and include u_int in the second stage? and would the first stage regressions have to necessarily include X2 and X3?
Apologies if I am double-posting, but I have struggled to find clear guidance or a post directly related to the below IV related questions - if someone could point me in the right direction I would greatly appreciate it.
I am implementing an IV estimation using the control function method, with different outcome variables Y. Some models are poisson, others OLS, but the endogenous regressor and the instrument are the same. I have always thought that
- in all IV estimations, the first and second stage need to include the same control variables
- if the second stage includes an interaction term between the endogenous regressor and some other variable, I need to have two first stages: one on the endogenous regressor itself, the other on the interaction term, where the instrument is interacted with the same variable.
I would like to confirm if the above are necessarily always correct. When using the control function method, I am including the residual from the first stage in the second stage, so would this not control for the endogenous part of the endogenous regressor as well as its interacted version? I understand that each additional endogenous regressor requires an additional instrument, but is an interaction of the same variable really an additional endogenous regressor? Moreover, if the control variables used in the second stage are not required for the validity of the instrument, do they need to be in the first stage?
So, if the second stage is
Y = b0 + b1 X1 + b2 X1*X2 + b3 X2 + b4 X3 + u + e
where u is the residual from a regression
X1 = a0 + a1 Z + u
would I need to also run
X1*X2 = a0 + a1 Z*X2 + u_int
and include u_int in the second stage? and would the first stage regressions have to necessarily include X2 and X3?
Comment