Hello all,
I am working with an unbalanced panel dataset comprising 350 firms over a 20-year period (yearly observations), and I am facing potential endogeneity issues with three of my explanatory variables. Unfortunately, the instruments I have considered so far appear to be weak, as indicated by Cragg-Donald Wald F-statistics below the conventional threshold of 10. As a result, I am exploring the use of the control function approach to address these endogeneity concerns.
I would greatly appreciate your input on the following two questions:
1. Can the control function approach be used to address endogeneity in multiple endogenous regressors simultaneously in a panel data context with fixed effects? If so, how should the first-stage regressions be specified? Should each endogenous regressor be regressed on the full set of exogenous variables and instruments, or is there a more efficient specification?
2. What is the correct procedure to obtain residuals from the first-stage regressions (i.e., the control functions) when using fixed effects estimators in panel data (e.g., with xtreg, fe in Stata)? I tried using predict res_first, resid, but this doesn’t seem to work properly in the fixed effects context. Is there a preferred command or method to extract the residuals needed for the second-stage regression?
Thank you very much in advance!
Best regards,
Nick
I am working with an unbalanced panel dataset comprising 350 firms over a 20-year period (yearly observations), and I am facing potential endogeneity issues with three of my explanatory variables. Unfortunately, the instruments I have considered so far appear to be weak, as indicated by Cragg-Donald Wald F-statistics below the conventional threshold of 10. As a result, I am exploring the use of the control function approach to address these endogeneity concerns.
I would greatly appreciate your input on the following two questions:
1. Can the control function approach be used to address endogeneity in multiple endogenous regressors simultaneously in a panel data context with fixed effects? If so, how should the first-stage regressions be specified? Should each endogenous regressor be regressed on the full set of exogenous variables and instruments, or is there a more efficient specification?
2. What is the correct procedure to obtain residuals from the first-stage regressions (i.e., the control functions) when using fixed effects estimators in panel data (e.g., with xtreg, fe in Stata)? I tried using predict res_first, resid, but this doesn’t seem to work properly in the fixed effects context. Is there a preferred command or method to extract the residuals needed for the second-stage regression?
Thank you very much in advance!
Best regards,
Nick
Comment