I want to estimate a DiD where there are several times points and where treatment does not occur at the same time for all treated units. Treatment is binary. Angrist and Pischke (2009) in MHE p. 233 or this post discuss differences between DiD with 2 time points and multiple time points. The proposed estimation is:
Yit = a + b Treat it + d Dit + t Year + e
Estimating this in Stata:
My question is: why/why not include individual fixed effects as well? Why not estimate:
Yit = a + b Treat it + d Dit + t Year + x ID + e
which would be an xtreg estimation:
Clearly the two estimations produce different results because one includes individual FE and the other does not. What is the motivation to choose one estimation over the other GIVEN the DiD framework?
Yit = a + b Treat it + d Dit + t Year + e
Estimating this in Stata:
Code:
webuse nlswork, clear gen treated = (idcode <= 2500) gen post = 0 if treated == 1 // treatment not appearing at the same time for everyone replace post = 1 if idcode <=500 & year >= 73 & treated == 1 replace post = 1 if idcode >500 & idcode <=1000 & year >= 71 & treated == 1 replace post = 1 if idcode >1000 & idcode <=2000 & year >=80 & treated == 1 replace post = 1 if idcode >=2000 & idcode <=2500 & year >=79 &treated == 1 reg ln_wage treated did i.year, cl(idcode)
Yit = a + b Treat it + d Dit + t Year + x ID + e
which would be an xtreg estimation:
Code:
xtset idcode xtreg ln_wage treated did i.year, fe cl(idcode)
Comment