Dear Statalist community,
I have learned of my mistakes and learned to interprete interaction effects.
I performed a simple OLS regression with a two-way interaction effect, x1 and x2 both are dummy variables.
The interpretation of x1 and x2 estimates is the differences to firms who have the other attribute to zero:
Firms who have x1 produce 1.3 less Y1 than firms who have x1=0.
Firms who have x2 produce more Y1 than firms who have x2=0.
Firms whi have x1 and x2 set to one, produce more Y than firms with x1=0 and x2=1, or vice versa.
My question is now, does this suffice to reliably state that firms who have x2=1 provide more Y than firms with x2=0?
If I drop variables with x2=1 (0) and then look at means, I get the following result. Is it realiable to just calcualte differences between these two means, i.e. (-2.45- (2.51)?
How could I then show what influence the interaction with x1 has?
I have learned of my mistakes and learned to interprete interaction effects.
I performed a simple OLS regression with a two-way interaction effect, x1 and x2 both are dummy variables.
Code:
. reghdfe Y x1##x2 c1 c2 c3 c4 c5 c6 c7 c8 c9, absorb(FIRM Year) cluster(Industry) (dropped 27 singleton observations) (MWFE estimator converged in 7 iterations) HDFE Linear regression Number of obs = 487 Absorbing 2 HDFE groups F( 12, 21) = 63.45 Statistics robust to heteroskedasticity Prob > F = 0.0000 R-squared = 0.4268 Adj R-squared = 0.1758 Within R-sq. = 0.0595 Number of clusters (Industry) = 22 Root MSE = 0.7048 (Std. err. adjusted for 22 clusters in Industry) ------------------------------------------------------------------------------ | Robust Y | Coefficient std. err. t P>|t| [95% conf. interval] -------------+---------------------------------------------------------------- 1.x1 | -1.305025 .0995242 -13.11 0.000 -1.511997 -1.098053 1.x2 | .3349848 .1583003 2.12 0.046 .0057813 .6641883 | x1#x2 | 1 1 | 1.501429 .3299879 4.55 0.000 .8151814 2.187676 | c1 | -.0236591 .0586029 -0.40 0.691 -.1455305 .0982123 c2 | -.0256179 .0314577 -0.81 0.425 -.0910377 .0398018 c3 | .1965687 .1940449 1.01 0.323 -.2069697 .6001071 c4 | .0366049 .0571484 0.64 0.529 -.0822417 .1554514 c5 | .0089514 .0059729 1.50 0.149 -.0034699 .0213726 c6 | -.5765208 .5296322 -1.09 0.289 -1.677951 .5249097 c7 | .0045407 .0007158 6.34 0.000 .0030521 .0060293 c8 | .0104786 .0261328 0.40 0.692 -.0438676 .0648248 c9 | -.3780107 .2715036 -1.39 0.178 -.9426333 .186612 _cons | -5.923346 2.962192 -2.00 0.059 -12.08356 .2368706
Firms who have x1 produce 1.3 less Y1 than firms who have x1=0.
Firms who have x2 produce more Y1 than firms who have x2=0.
Firms whi have x1 and x2 set to one, produce more Y than firms with x1=0 and x2=1, or vice versa.
My question is now, does this suffice to reliably state that firms who have x2=1 provide more Y than firms with x2=0?
If I drop variables with x2=1 (0) and then look at means, I get the following result. Is it realiable to just calcualte differences between these two means, i.e. (-2.45- (2.51)?
How could I then show what influence the interaction with x1 has?
Code:
. drop if x2==1 (381 observations deleted) . univar Y -------------- Quantiles -------------- Variable n Mean S.D. Min .25 Mdn .75 Max ------------------------------------------------------------------------------- Y 133 -2.45 0.81 -4.41 -2.91 -2.34 -1.92 -0.59 . drop if x2==0 (133 observations deleted) . . univar Y -------------- Quantiles -------------- Variable n Mean S.D. Min .25 Mdn .75 Max ------------------------------------------------------------------------------- Y 381 -2.51 0.77 -4.53 -2.98 -2.43 -2.00 -0.56
Comment