Hello,
I am working with patient-level hospital data and estimating an IV/2SLS model where my endogenous variable is repeat_mr_same_day instrumented by low_quality. While doing this, in the second stage I want to keep the full set of controls like hospital type, MR utilization per machine, etc., but in the first stage I would like to exclude some of these controls like MR utilization, hospital type, because they have role in predicting the endogenous regressor.
When I use ivreg2 with the partial() option, the variables are only “partialled out” from the output but they still enter the first stage. So I want to ask:
Best Regards
Tuğba
I am working with patient-level hospital data and estimating an IV/2SLS model where my endogenous variable is repeat_mr_same_day instrumented by low_quality. While doing this, in the second stage I want to keep the full set of controls like hospital type, MR utilization per machine, etc., but in the first stage I would like to exclude some of these controls like MR utilization, hospital type, because they have role in predicting the endogenous regressor.
When I use ivreg2 with the partial() option, the variables are only “partialled out” from the output but they still enter the first stage. So I want to ask:
- Is there any way in Stata to estimate a standard IV/2SLS where certain controls are kept only in the second stage but not used in the first stage?
- If not, is the recommended approach to implement manual SLS (run the first stage with a reduced set of regressors, save the fitted values, then run the second stage with all controls), combined with cluster-robust bootstrap to get valid standard errors?
- Would sem or reg3 be a better framework if I want two equations with different right-hand-side variables, while still using cluster-robust SEs?
Best Regards
Tuğba
Comment