I want to estimate the effect of d on y using a simple 2x2 DiD model with a balanced panel (I observe treatment and control group before and after treatment). I suspect that parallel trends are more plausible conditional of a time-varying covariate X, so I want to control for this. Running a TWFE regression (equation 1) with individual and time fixed effects and an indicator D equal to one when the treatment group is in the second period yields very different results than the "classical" implementation (equation 2). Without X, the results are the same. I know that time-varying covariates can be a problem if they affect treatment or vice versa, but independently, does anyone know or can explain why the estimation of D_it and treat*post are not the same? (In my case, the TWFE estimate is 10 times larger). I haven't found an explanation and I can't wrap my head around it
xtreg y i.individual + i.time + i.d + X, vce(cluster i.individual) (1)
xtreg y i.treat + i.post +(i.treat*i.post) + X, vce(cluster i.individual) (2)
xtreg y i.individual + i.time + i.d + X, vce(cluster i.individual) (1)
xtreg y i.treat + i.post +(i.treat*i.post) + X, vce(cluster i.individual) (2)
Comment