Dear all,
I have an unbalanced panel data set of N=240 entities over 17 years with nearly 1,200 observations.
Using Stata version 14.1, I am running system GMM (xtabond2) and a fixed effects (xtreg, fe) regression to determine the effect of x and its lagged values on y, but am getting contradictory results.
1) GMM
Here is my GMM syntax:
When running system GMM, I get the following relationship between my independent and dependent variable:
L0.x -> positive effect on y (p<0.10)
L1.x -> negative effect on y (p<0.01)
L2.x -> positive effect on y (p<0.05)
L3.x -> negative effect on y (p<0.10)
Number of instruments = 125, Number of groups = 145.
The model seems to be properly identified as:
AR(1): p=0.001, AR(2): p=0.172
Sargan test of overid. restrictions: chi2(95) = 303.22 Prob > chi2 = 0.000
(Not robust, but not weakened by many instruments.)
Hansen test of overid. restrictions: chi2(95) = 107.95 Prob > chi2 = 0.172
(Robust, but weakened by many instruments.)
If I exclude L2.y L3.y from the model, only the effect of L1.x on y remains significant.
Depending on whether I add or delete some control variables, L0.x-L3.x gain or lose on their significance.
I do not know whether this could play a role for GMM, but my independent variable in t is highly correlated with its own values in t-1 (correlations between 0.8 and 0.96). The same applies to the dependent variable (correlations between 0.95 and 0.99).
2) Fixed effects
To prove the robustness of my GMM results, I am running the same regression in fixed effects. For this purpose, I have created lagged values of my independent variable x (L1.x-L3.x) for fixed effects.
Here is my fixed effects syntax without control variables:
And here is my fixed effects syntax with control variables:
With or without my control variables, I am getting the following results:
L1.x -> positive effect on y (p<0.001)
L2.x, L3.x -> negative n.s.
If I am running individual regressions for each of the lagged values of independent variables (e.g. syntax: xtreg y L3.x, fe), I get a positive significant effect of each lagged value on y.
So whereas GMM suggests that the lagged values of x influence y positive in L0 and L2 and negative in L1 and L3, fixed effects show only positive effects of x and its lagged values on y.
Therefore I would like to kindly ask for your advice:
Thank you very much in advance!
Kind regards,
Maria
I have an unbalanced panel data set of N=240 entities over 17 years with nearly 1,200 observations.
Using Stata version 14.1, I am running system GMM (xtabond2) and a fixed effects (xtreg, fe) regression to determine the effect of x and its lagged values on y, but am getting contradictory results.
1) GMM
Here is my GMM syntax:
Code:
xtabond2 y L.y L2.y L3.y L(0/3).x cvar1 cvar2 cvar3 cvar4 cvar5 yr*, gmmstyle(L.y, collapse) gmmstyle(L.(x cvar1 cvar2 cvar3 cvar4 cvar5), collapse) ivstyle (yr*, equation(level)) twostep robust orthogonal
L0.x -> positive effect on y (p<0.10)
L1.x -> negative effect on y (p<0.01)
L2.x -> positive effect on y (p<0.05)
L3.x -> negative effect on y (p<0.10)
Number of instruments = 125, Number of groups = 145.
The model seems to be properly identified as:
AR(1): p=0.001, AR(2): p=0.172
Sargan test of overid. restrictions: chi2(95) = 303.22 Prob > chi2 = 0.000
(Not robust, but not weakened by many instruments.)
Hansen test of overid. restrictions: chi2(95) = 107.95 Prob > chi2 = 0.172
(Robust, but weakened by many instruments.)
If I exclude L2.y L3.y from the model, only the effect of L1.x on y remains significant.
Depending on whether I add or delete some control variables, L0.x-L3.x gain or lose on their significance.
I do not know whether this could play a role for GMM, but my independent variable in t is highly correlated with its own values in t-1 (correlations between 0.8 and 0.96). The same applies to the dependent variable (correlations between 0.95 and 0.99).
2) Fixed effects
To prove the robustness of my GMM results, I am running the same regression in fixed effects. For this purpose, I have created lagged values of my independent variable x (L1.x-L3.x) for fixed effects.
Here is my fixed effects syntax without control variables:
Code:
xtreg y L1.x L2.x L3.x, fe
Code:
xtreg y L1.x L2.x L3.x cvar1 cvar2 cvar3 cvar4 cvar5 yr*, fe
L1.x -> positive effect on y (p<0.001)
L2.x, L3.x -> negative n.s.
If I am running individual regressions for each of the lagged values of independent variables (e.g. syntax: xtreg y L3.x, fe), I get a positive significant effect of each lagged value on y.
So whereas GMM suggests that the lagged values of x influence y positive in L0 and L2 and negative in L1 and L3, fixed effects show only positive effects of x and its lagged values on y.
Therefore I would like to kindly ask for your advice:
- What could be the reason for the results discrepancy between GMM and fixed effects?
- Do you see any misspecification in my GMM or fixed effects models, which could be responsible for this discrepancy?
- What could I do to verify, which results are correct?
Thank you very much in advance!
Kind regards,
Maria
Comment