Hi,
I have a country-year-panel dataset with T=5 and N=130 and I want to estimate a lagged dependent variable (LDV) model and compare it to an autoregressive distributive lag model (ARDL) and an autoregressive distributive lag model with a second lag of the LDV (ARDL_LDV2). Following Beck/Katz (2011) I want to mean-center all explanatory variables by year and country (in order to allow for year-and country specific intercepts) by simultaneously applying panel corrected standard errors.
To apply this, I first mean-center all variables (dependent and independent variables) by country and include year dummies in the OLS-regression on the deviations without an intercept.
According to William Goulds post on "Interpreting the intercept in the fixed-effects model":
Including one (exogenous) predictor indeed leads to the same coefficient but different standard errors. However by including another country-mean-centered predictor the coefficients of both variables do not equal the results from fixed effects estimation anymore. Similarly, replacing x2 with i.year also leads to these differences in coefficients and standard errors between the results of the fe-command and the country-mean-centered results.
Why is that the case?
My second question relates to the application of group-mean versus grand-mean-centering to estimate interaction effects:
Following Aiken/West (1991) I test interaction effects, which I grand-mean-center before entering into my regression model in order to reduce the issue of multicollinearity and make interpretation easier. Therefore, how can I combine removing unit heterogeneity by group-mean-centering (as suggested by Beck/Katz) and reduce multicollinearity by grand-mean-centering at the same time?
Any comments or suggestions are welcome!
Thanks a lot in advance!
I have a country-year-panel dataset with T=5 and N=130 and I want to estimate a lagged dependent variable (LDV) model and compare it to an autoregressive distributive lag model (ARDL) and an autoregressive distributive lag model with a second lag of the LDV (ARDL_LDV2). Following Beck/Katz (2011) I want to mean-center all explanatory variables by year and country (in order to allow for year-and country specific intercepts) by simultaneously applying panel corrected standard errors.
To apply this, I first mean-center all variables (dependent and independent variables) by country and include year dummies in the OLS-regression on the deviations without an intercept.
According to William Goulds post on "Interpreting the intercept in the fixed-effects model":
„(…) removing within-group means and estimating a regression on the deviations without an intercept (as given in equation 3) produces the same coefficients but different standard errors.“ [compared to xtreg, fe]
Comparing these group-mean centered OLS results (without constant) with the results of Stata’s official xtreg,fe command should according William Goulds post lead to the same estimates but different standard errors, because of the difference in equation 3 (group-mean-centering) and equation 5 (which is applied by Stata’s xtreg,fe-command) (see Gould's post).Code:egen double ybar = mean(y), by(ccode) egen double x1bar = mean(x), by(ccode) egen double x2bar = mean(x), by(ccode) gen yd = y-ybar gen x1d = x-x1bar gen x2d = x-x2bar xtreg y x1 x2, fe reg yd x1d, noconstant reg yd x1d x2d, noconstant reg yd x1d i.year, noconstant
Including one (exogenous) predictor indeed leads to the same coefficient but different standard errors. However by including another country-mean-centered predictor the coefficients of both variables do not equal the results from fixed effects estimation anymore. Similarly, replacing x2 with i.year also leads to these differences in coefficients and standard errors between the results of the fe-command and the country-mean-centered results.
Why is that the case?
My second question relates to the application of group-mean versus grand-mean-centering to estimate interaction effects:
Following Aiken/West (1991) I test interaction effects, which I grand-mean-center before entering into my regression model in order to reduce the issue of multicollinearity and make interpretation easier. Therefore, how can I combine removing unit heterogeneity by group-mean-centering (as suggested by Beck/Katz) and reduce multicollinearity by grand-mean-centering at the same time?
Any comments or suggestions are welcome!
Thanks a lot in advance!
Comment